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Abstract. Let P be a set of n points in the plane, not all on a line. 
We show that if n is large then there are at least n/2 ordinary lines, 
that is to say lines passing through exactly two points of P. This 
confirms, for large n, a conjecture of Dirac and Motzkin. In fact we 
describe the exact extremisers for this problem, as well as all sets 
having fewer than n — C ordinary lines for some absolute constant 
C. We also solve, for large n, the "orchard-planting problem", 
which asks for the maximum number of lines through exactly 3 
points of P. Underlying these results is a structure theorem which 
states that if P has at most Kn ordinary lines then all but O(K) 
points of P lie on a cubic curve, if n is sufficiently large depending 
on K, 
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1. Introduction 

The Sylv ester- Galai theorem is a well-known theorem in combinato- 
rial geometry It was proven by Gallai |18j in response to a question of 
Sylvester [35] from forty years earlier. 
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QUESTION'S FOR SOLTJ TIOU. 
11 851. fProfeiianr SlXTBSTKiL,)— Vtcyb that it is not po«ilik to 
arranga nny fluiti; number of rcnl points SO thut n right Jiiid^ tbroiiKlL 
every two of them shall pass through a third* utiIuw they all lie in the 
same right line. 

Figure 1. Sylvester's question [35] . 

Theorem 1.1 (Sylvester-Gallai theorem). Suppose that P is a finite 
set of points in the plane, not all on one line. Then there is an ordinary 
line spanned by P, that is to say a line in P containing exactly two 
points. 

Several different proofs of this now appear in the literature. We 
will be particularly interested in a proof due to Melchior [26] based on 
projective duality and Euler's formula, which we will recall in Section 
[3} It is natural to wonder how many ordinary lines there are in a set 
of P points, not all on a line, when the cardinality \P\ of P is equal 
to n. Melchior's argument in fact shows that there are at least three 
ordinary lines, but considerably more is known. Motzkin [27] was the 
first to obtain a bound (of order n 1 / 2 ) tending to infinity with n. Kelly 
and Moser [23] proved that there are at least 3n/7 ordinary lines, and 
Csima and Sawyer [11J improved this to 6n/13 when n > 7. Their 
work used ideas from the thesis of Hansen [20], which purported to 
prove the n/2 lower bound but was apparently flawed. An illuminating 
discussion of this point may be found in the MathSciNet review of 
There are several nice surveys on this and related problems; see pE], 
PH Chapter 17], [2H] or [29]. 

One of our main objectives in this paper is clarify this issue for 
large n. The following theorem resolves, for large n, a long-standing 
conjecture which has been known as the Dirac-Motzkin conjecture. Ap- 
parently neither author formally conjectures this in print, though Dirac 
[T2] twice states that its truth is "likely" . Motzkin [27] does not seem 
to mention it at all. 

Theorem 1.2 (Dirac-Motzkin conjecture). Suppose that P is a finite 
set of n points in the plane, not all on one line. Suppose that n ^ n 
for a sufficiently large absolute constant n . Then P spans at least n/2 
ordinary lines. 

We will in fact establish a more precise result obtaining the exact 
minimum for all n ^ n as well as a classification of the extremal 
examples. One rather curious feature of this more precise result is that 
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2. Fain would I plpnt a grove in rows, 
But how must I its form compose 

With three trees in tach row ; 
To have as many rows as trees j 
Now tell me, artists, if you please; 

'Tis all I want to know. 

Figure 2. Jackson's question [22"] . 



if n is odd there must be at least 3n/4 — 0(1) ordinary lines. See 
Theorems 2.2 and |2.4 below for more details. When n is even, one can 



attain n/2 ordinary lines by combining n/2 equally spaced points on a 



circle with n/2 points at infinity; see Proposition 2.1 below. 

For small values of n, there are exceptional configurations with fewer 
than n/2 ordinary lines. Kelly and Moser [23J observe that a triangle 
together with the midpoints of its sides and its centroid has n = 7 and 
just 3 ordinary lines. Crowe and McKee [TO] provide a more compli- 
cated configuration with n = 13 and 6 ordinary lines. It is possible 



that Theorem 1.2 remains true for all n with the exception of these 
two examples (or equivalently, one could take no as low as 14). Unfor- 
tunately our method does not give a good bound for n Q ; we could in 
principle compute such a bound, but it would be of double exponential 
type and, we think, not worth the effort. 

Our methods also apply (in fact with considerably less effort) to 
resolve sufficiently large instances of a slightly less well-known (but 
considerably older) problem referred to in the literature as the orchard 
problem. This was first formally posed by Sylvester [31] in 1868 (Figure 
[3]), though the 1821 book of Jackson [22] has a whole section containing 
puzzles of a similar flavour, posed more eloquently than any result in 
this paper (Figure [2]). 

Theorem 1.3 (Orchard problem). Suppose that P is a finite set of n 
points in the plane. Suppose that n ^ no for some sufficiently large 
absolute constant no. Then there are no more than \n(n — 3)/6j + 1 
lines that are 3-rich, that is they contain precisely 3 points of P. (Here 
and in the sequel, \_x\ denotes the integer part of x.) 

This theorem is tight for large n, as noted by Sylvester [31]. See also 
Burr, Griinbaum and Sloane [5]. We will give these examples, which 



are based on irreducible cubic curves, in Proposition 2J3 below. Again, 
there are counterexamples for small n. In particular, the example of 
a triangle, the midpoints of its sides and its centroid has n = 7 but 6 
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©UESTIOHS FOB, SGIrireSOXf, 
2572. (Proposed by Professor Stltestee.)-1. If It, ?.-. ere the dis- 
tinct prune Factors of „ prove that the number of points (which I term pin- 
periect points of the »th order) at which a cubic curve can have the biXst 
degree of contact mth a curve of tl.e decree n (not composed of wpetSioiK 
of a curve of lower dejrree) is Sn- (I -7<--) (X - jfr-5) (1_ A) 

2. Show that if the first tree in the solution to Quest. 2478 ( Zeprint, Vol. 
MIL, p. 106) be planted at a pluperfect point of the nth order in a cubic, 
the sequence of toe-marks I, 2. 4, 5, 7... may be replaced bv recurring 
periods of 3«. numerals, and that the two halves of each period will consist of 
the same n numerals arranged in reverse order, that in fact onjv the first n 
of the numerals 1,2,4.... need appear in the result. 

«. Hence prove that n trees may be so arranged as to contain between 
tt«E^(»-I)] -E{ 4 »}row8 of three in a row, »kt E (the symbol 
of entirety) denotes that the integer part only is to hetahen of the function 
WuiCfl it governs, 

I 1\ T 0TE.- Thus for SI trees the number will be 1053 instead of SOO, the 
number obtained when the hrst tree is at a noivpluperfeet point; and for IS 
trees the numoeiMS 30 instead of 26, the number stated in the Editorial 
reference at the end of the solution above cited as applicable to 15 points 
t ! T ? Ct ' ae , M ? te fe according to the formulated 
above It must, however, be observed, thatthis formula, does not in general 
pre the absolute maximum. Thus, for n = 9 the formula gives W 9 
rows, whereas the true maximum (dealing with real trees) is 10, In fact, 
E-^iK^-l)] is the arithmetical limit obtained by dividing the number 
of duads of » by 3, the number of them in each row. Tims, E I kt is the dif- 
ference between this limit and the actual number of the formula Which may 
also be expressed as equal to $(»«_S n ) when this is an iute'e- and the 
integer nnmechately superior to it when it is fractional. Thus we ee in n 
moment that with 1000 trees, the number of rows eontah n't, 4ein ^ 
row may be made equal to 16616". In a word, when the first tree i 'pti e l 
atanerumary point in the cubic, the number of rows given by the &te 
in Quest. 2473 is K{j(«-I)=], when at a pluperfect point the number is 
E{i(n-a)(„-s>}. 
Another method of placing points on a cubic which s > £ . s 10 rows for 9 

f&^SSg? 1111 cases tl,e trae *>™ *» 

Figure 3. Sylvester's question |33] . 

lines containing precisely three points of P; by contrast, the bound of 
Theorem 11.31 is 5 in this case. 

Underlying the proof of both of these results are structure theorems 
for sets with few ordinary lines, which are perhaps of independent inter- 
est. The most basic such result is the following. We use the asymptotic 
notation X = 0(Y) or X <C Y to denote the bound \X\ ^ CY for some 
absolute constant C. 

Theorem 1.4 (Weak structure theorem). Suppose that P is a finite 
set of n points in the plane. Suppose that P spans at most Kn ordinary 
lines for some K ^ 1. Suppose also that n ^ expexp^fT ) for some 
sufficiently large absolute constant C . Then all but at most 0(K°^) 
points of P lie on an algebraic curve 7 of degree at most 3. 

In fact we establish a slightly more precise statement, see Proposition 



6.13 below. Note that we do not require the algebraic curve 7 to be 



irreducible; thus 7 could be an irreducible cubic, the union of a conic 
and a line, or the union of three lines. As we shall see in later sections, 
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cubic curves arise naturally in the study of point-line configurations 
with few ordinary lines, in large part due to the well-known abelian 
group structure (or pseudo-group structure) defined by the collinearity 
relation on such curves (or equivalently, by Chasles's version of the 



Cayley-Bacharach theorem, see Proposition |4.1 ). The lower bound 
n ^ exp exp(CK c ) is present for rather artificial reasons, and can 
likely be improved substantially. 

Projective geometry. Much of the paper is best phrased in the lan- 
guage of projective geometry. We recall for the convenience of the 
reader the notion of the projective plane MP 2 as (IR 3 \ {0})/ ~, where 
(x, y, z) ~ (x', y', z') if and only if there is some A ^ such that 
x' = Xx, y' = Xy and z' = Xz. We denote points of IRP 2 with square 
brackets, thus [x,y, z] is the equivalence class of (x,y,z) under ~. We 
have the embedding IR 2 > MP 2 given by (x, y) t— > [x, y, 1]; in fact MP 2 
may be thought of as IR 2 together with the line at infinity consisting of 
points [x,?/,0] (modulo the equivalence relation ~). For the point-line 
incidence problems considered in this paper, the projective and afline 
formulations are equivalent. Indeed given a finite set of points P in 
MP 2 , we may apply a generic projective transformation so as to move 
all points of P to the affine plane IR 2 if desired, without affecting the 
number of ordinary lines or 3-rich lines. This is illustrated in the figures 
in Section [U 

For our purposes, there are two main advantages of working in pro- 
jective space instead of afline space. The first is to allow the use of 
projective transformations to normalise one's geometric configurations, 
for instance by moving a line to the line at infinity, or transforming a 
non-singular irreducible cubic curve into an elliptic curve in Weierstrass 
normal form. The other main advantage is the ability to utilise point- 
line duality. Given a point p = [a,b,c\, one may associate the line 
p* := {[x,y,z] : ax + by + cz = 0}, and conversely given a projective 
line £ = {[x,y,z] : ax + by + cz = 0} one may associate the point 
£* = [a, b, c\. It is clear that p G £ if and only if £* G p*. Working in the 
dual can provide us with information that is difficult to access other- 
wise. We shall see this twice: once in Sections [3] and [4], when we apply 
Euler's formula in the dual setting following an argument of Melchior 
[26], and then again in Section [6] where we will employ a convexity 
argument, due to Luke Alexander Betts, in the dual setting. 

Next, we give a structure theorem which is more precise than Theo- 
rem H31 
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Theorem 1.5 (Full structure theorem). Suppose that P is a finite set 
of n points in the projective plane MP 2 . Suppose that P spans at most 
Kn ordinary lines. Suppose also that n ^ expexp(CK c ) for some 
sufficiently large absolute constant C . Then, after applying a projective 
transformation if necessary, P differs by at most 0(K) points (which 
can be added or deleted) from an example of one of the following three 
types: 

(i) n — 0(K) points on a line; 

(ii) The set 

2irj . 2?rj 

X 2m ■= { cos ,sm , 1 : ^ j < m\ 

m m 

7T 7 7T 7 

U { -sin— ,cos — ,01,0 < j < m\ (1.1) 
m m 

consisting of m points on the unit circle and m points on the 
line at infinity, for some m — | + 0(K); 

(iii) A coset H © g , 3g G H , of a finite subgroup H of the non- 
singular real points on an irreducible cubic curve, with H hav- 
ing cardinality n + 0(K) (the group law © on such curves is 
reviewed in Section^ below). 



Conversely, every set of this type has at most 0(Kn) ordinary lines. 

We have the following consequence, which can handle slowly growing 
values of K. 

Corollary 1.6. Suppose that P is a finite set of n points in the pro- 
jective plane MP 2 . Suppose that P spans at most n(loglogn) c ordinary 
lines for some sufficiently small constant c > 0. Then, after applying a 
projective transformation, P differs by at most o(n) points from one of 



the examples ii), (ii), (iii) detailed in Theorem 1.5 above. In particular 
we may add/remove o(n) points to/from P to get a set with at most 
n + 0(1) ordinary lines. 

Here, of course, o(n) denotes a quantity which, after dividing by n, 
tends to zero as n goes to infinity. 

Remark. This corollary may, for all we know, be true with a much 
weaker assumption, perhaps even that P spans o(n 2 ) ordinary lines. 
Certainly, if one could weaken the hypothesis n ^ exp exp(CK c ) in 
Theorem 11.41 then one could do so also in Theorem 11.51 
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Proof methods. As mentioned previously, the starting pointQ of our 
arguments will be Melchior's proof |26j of the Sylvester-Gallai theorem 
using duality and the Euler formula V — E + F = 1 for polygonal 
decompositions of the projective plane MP 2 . Melchior's argument uses 
at one point the obvious fact that all polygons have at least three sides 
to obtain an inequality implying the existence of ordinary lines. The 
same argument also shows that if a point set P spans very few ordinary 
lines, then almost all of the polygons in the dual configuration T p (cut 
out by the dual lines p* for p G P) must in fact have exactly three 
sides. Because of this, it is possible to show in this case that the 
dual configuration contains large regions which have the combinatorial 
structure of a regular triangular grid. 

The next key observation is that inside any triangular grid of non- 
trivial size, one can find "hexagonal" configurations of lines and points 
(see Figure 14 ) which are dual to the configuration of lines and points 
arising in Chasles's version of the Cayley-Bacharach theorem (Propo- 



sition 4.1 below). From this observation and some elementary combi- 



natorial arguments, one can start placing large subsets of P on a single 
cubic curve. For instance, in Proposition 5A_ we will be able to estab- 
lish a "cheap structure theorem" asserting that a set of n points with 
fewer than Kn ordinary lines can be covered by no more than 500fT 
cubic curves. This observation turns out not to be new - a closely re- 
lated technique is used in a paper of Carnicer and Godes [7] concerning 
generalised principal lattices, which arise in interpolation theory. 

In principle, this cheap structure theorem already reduces the under- 
lying geometry from a two-dimensional one (the projective plane MP 2 ) 
to a one-dimensional one (the union of a number of cubic curves) . Un- 
fortunately, the collinearity relation between distinct cubic curves is 
too complicated to handle directly. Because of this, we must refine the 
previous combinatorial analysis to strengthen the structural control on 
a point set P with few ordinary lines. By studying the lines connecting 
a typical point p in P with all the other points in P using the triangular 
grid analysis, one can obtain a more complicated partition of P into 
cubic curves passing through p. A detailed statement may be found in 



Lemma 5.2 Comparing such a partition with the reference partition 



coming from the cheap structure theorem, one can obtain Proposition 



5.3, a structure theorem of intermediate strength. Roughly speaking 



this result asserts that most of the points in P lie on a single irreducible 



One defect of this approach is that it breaks down totally in the complex case, 
and so we have nothing new to say here about ordinary lines or 3-rich lines for 
configurations of complex lines in CP 2 . 
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cubic curve, on a union of an irreducible conic and a bounded number 
of lines (with the points shared almost evenly between the conic and 
the lines) , or on the union of a bounded number of lines only. 

The next stage is to cut the number of li nes involved down to one. 
The key propositior^jhere is Proposition It asserts, roughly speak- 
ing, that a set of n points on two or more lines, which contains 3> n 
points on each line, must generate ^> n 2 ordinary lines. This is a state- 
ment that fails in finite field geometries, and must use at some point the 
torsion-free nature of the real line 1R (see Section [2] for more discussion 
of this point, particularly with regards to the "near-counterexamples" 
(2.1 ) and ( 2.2[ )). There are two key cases of the proposition which need 
to be established. The first is when the lines involved are all concur- 
rent or, after a projective transformation, all parallel. In this case we 
use an argument of Betts, Proposition GA, involving projective dual- 
ity and convexity. The use of convexity here is where the torsion-free 
nature of R is implicitly used. In the case when the lines are not con- 
current, we instead rely on Menelaus's theorem to introduce various 
ratios of lengths, and then exploit a sum-product estimate of Elekes, 
Nathanson, and Ruzsa [14J. This latter result, stated in Proposition 



A. 8 also implicitly exploits the torsion-free nature of 



The result of the above analysis is a yet stronger structure theorem 
for sets P with few ordinary lines: P is mostly placed in either an 
irreducible cubic curve, the union of an irreducible conic and a line, 
or on a single line. A detailed statement may be found in Proposition 



6.13 The latter case, in which almost all points lie on a line, is easily 
studied. To deal with the other two cases one uses the abelian group 
structure on irreducible cubics, as well as the analogous pseudo- group 
structure on the union of a conic and a line. The information that P 
contains few ordinary lines can then be converted to an additive combi- 
natorics property on finite subsets of an abelian group. Fairly standard 
tools from additive combinatorics then show that P is almost a finite 
subgroup of that abelian group. This allows us to rule out "essentially 
torsion-free" situations, such as that provided by singular cubic curves 
(except for the acnodal singular cubic curve), and eventually leads to 
the full structure theorem in Theorem 11.51 

To solve the Dirac-Motzkin conjecture and the orchard problem for 
large n, we observe that potential counterexamples P to either con- 
jecture will have few ordinary lines and hence can be described by 



Unfortunately, our reduction to this key proposition is somewhat expensive with 
regards to the quantitative bounds, and is responsible for the double exponential 
lower bound required on n. 
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Theorem |1.5[ This quickly implies that P is close, up to projective 
transformation, to one of the known extremisers coming from roots of 
unity or from subgroups of elliptic curves, with a small number of addi- 
tional points added or removed. The remaining task is to compute the 
effect that these added/removed points have on the number of ordinary 
lines or 3-rich lines in P. Here, to get the strongest results, we will need 
a slight variant of a result of Poonen and Rubinstein [3U] in order to 
control the number of times a point may be concurrent with two roots 



of unity. See Proposition |7.5| for details. Of the two problems, the 
orchard problem turns out to be somewhat easier, and can be in fact 
established using only the intermediate structure theorem in Theorem 
15.31 rather than the more difficult structure theorem in Theorem 11.51 
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grateful to Noga Alon, Boris Bukh, Frando Mariacci, Bjorn Poonen 
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Frank de Zeeuw for pointing out the case of the acnodal singular cubic 
curve, which was not treated properly in the first draft of this paper. 
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pitality when another portion of this work was completed. The first 
author was supported by an ERC Starting Investigators Grant. The 
second author was partially supported by a Simons Investigator award 
from the Simons Foundation and by NSF grant DMS-0649473. 

2. The key examples 

The aim of this section is to describe the various key examples of 
sets with few ordinary lines or many 3-rich lines. In particular we 



describe the sets A 2m in (1.1 ) and the Sylvester examples appearing in 



the various cases of the main structure theorem, Theorem 1.5 We will 
also mention some important "near-counterexamples" which do not 
actually exist as finite counterexamples to the structure theorem, but 
nevertheless are close enough to genuine counterexamples that some 
attention must be given in the analysis to explicitly exclude variants 
of these examples from the list of possible configurations. All of these 
examples are connected to the group structure on a cubic curve, or 
equivalently to Chasles's version of the Cayley-Bacharach theorem as 



described in Proposition 4A below. The main variation in the examples 
comes from the nature of the cubic curve being considered, which may 
or may not be irreducible and/or nonsingular. 
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Boroczky and near-Bdrdczky examples. We begin with the sets X 



2m 

from (1.1), together with some slight perturbations of these sets de- 
scribed by Boroczky (as cited in [10] )• These sets, it turns out, provide 
the examples of non-collinear sets of n points with the fewest number 
of ordinary lines, at least for n large. 

Proposition 2.1 (Boroczky examples). Letm ^ 2 be an integer. Then 
we have the following. 

(i) The set X2 m contains 2m points and spans precisely m ordinary 
lines. 

(ii) The set X^ together with the origin [0, 0, 1] contains Am + 1 
points and spans precisely 3m ordinary lines. 

(iii) The set X 4m minus the point [0,1,0] on the line at infinity 
contains Am — 1 points and spans precisely 3m — 3 ordinary 
lines. 

(iv) The set X 4m+2 minus any of the 2m + 1 points on the line at 
infinity contains Am + 1 points and spans 3m ordinary lines. 

Thus, if we define a function f : N — > N by setting /(2m) := m, 
/(4m + 1) := 3m and /(4m — 1) := 3m — 3, then there is an example 
of a set of n points in MP 2 , not all on a line, spanning f(n) ordinary 
points. 

Proof. This is a rather straightforward check, especially once one 
has drawn suitable pictures. Whilst the unit circle together with the 
line at infinity form a pleasant context for calculational work, drawing 
configurations involving the line at infinity is problematic. In the four 
diagrams below, Figures [4], [5j [6] and [7j we have applied a projective 
transformation to aid visualisation. First of all we applied a rotation 
about the origin through 7r/12, and followed this by the projective 
map [x, y, z] !->■ [— y, x,2z + x). The unit circle is then sent to the 
ellipse whose equation in the afline plane is Ax 2 + 3(y + |) 2 = |, while 
the line at infinity is sent to the horizontal line y — 1. The origin is 
mapped to itself, and the point [0, 1, 0] at infinity now has coordinates 
(— cot(7r/12), 1) m (—3.73,1). In the pictures, ordinary lines are red 
and lines with three or more points of P are dotted green. 
It is helpful to note that the line joining 

r 2ttj . 2nj 

cos , sm , 1 

m m 



and 



r 27TJ' . 2<Kj> 

cos , sm , 1 

m m 
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m 



m 



on the line at infinity (cf. the proof of Proposition 7.3) 



For case (i), the ordinary lines are the m tangent lines to the m th 
roots of unity The case m = 6 is depicted in Figure [4j 



FIGURE 4. The Boroczky example a set with n = 
12 points and 6 ordinary lines. The ordinary lines (in 
red) are just the tangent lines to the 6th roots of unity 
on the unit circle. 

In case (ii), the ordinary lines are the 2m tangent lines to the 2m th 
roots of unity together with the m lines joining the origin [0, 0, 11 to 
[— sin cos 0], j even. The case m = 6 is depicted in Figure 5^ . 

In case (iii), the ordinary lines are the 2m tangent lines to the 2m th 
root of unity except for those at the points [±1,0,1], whose corre- 
sponding point at infinity has now been removed. However we do have 
m — 1 new ordinary lines, the vertical lines joining [cos — , sin 1] and 
[cos n ( 2m ~fi sin 7r ( 2 J~ m ) x] f or j — \ m — 1. The case m = 6 is 
illustrated in Figure |6j 

Finally, in case (iv) the ordinary lines are the 2m + 1 tangent lines 
to the (2m + l) th roots of unity except for one whose corresponding 
point at infinity has been removed, together with m new ordinary lines 
joining pairs of roots of unity. This is illustrated in Figure [7] in the case 
m = 2. 

□ 



We remark that Proposition 2A_ illustrates a basic fact, namely that 
if one adds or removes K points to an n-point configuration, then 
the number of ordinary lines (or 3-rich lines) is modified by at most 
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Figure 5. The Boroczky example X12 together with the 
origin [0,0, 1], a set with n — 13 points and 9 ordinary 
lines. The ordinary lines (in red) are just the tangent 
lines to the 6th roots of unity on the unit circle, plus 3 
extra lines through the origin and 3 of the points on the 
line at infinity. 



Figure 6. The Boroczky example X 12 minus the point 
at infinity [0,1,0], a set with n — 11 points and 6 or- 
dinary lines. The ordinary lines (in red) are the 4 tan- 
gent lines to the 6th roots of unity on the unit circle not 
through the point at infinity, plus 2 extra lines passing 
through the point at infinity. 

0(Kn + K 2 ); this can be seen by first considering the K — 1 case 
and then iterating. This stability with respect to addition or deletion 
of a few points is reflected in the conclusions of the various structural 
theorems in this paper. 

We may now state our more precise version of the Dirac-Motzkin 
conjecture for large n. 
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FIGURE 7. The Boroczky example Xiq minus the point 
at infinity [0, 1,0], a set with n = 9 points and 6 ordinary 
lines. The ordinary lines (in red) are the 4 tangent lines 
to the 6th roots of unity on the unit circle not through 
the point at infinity, plus 2 extra lines passing through 
the point at infinity. 

Theorem 2.2 (Sharp threshold for Dirac-Motzkin) . Let the function 
f : N — > N be defined by setting /(2m) := m, /(4m + 1) := 3m and 
/(4m — 1) := 3m — 3. There is an hq such that the following is true. If 
n ^ no and if P is a set of n points in WP 2 , not all on a line, then P 
spans at least f(n) ordinary lines. Furthermore if equality occurs then, 
up to a projective transformation, P is one of the Boroczky examples 
described in Proposition \2.1\ above. 



Remark. Note in particular that there is an essentially unique ex- 
tremal example unless n = l(mod 4), in which case there are two, 
namely examples (ii) and (iv) above. Note that all of the examples in 
(iv) are equivalent up to rotation. 

Let us record, in addition to the Boroczky examples mentioned in 



Proposition 2.1 the following near-extremal example. 



Proposition 2.3 (Near-Boroczky example). The set X^ m minus the 
point [— sin cos 0] on the line at infinity contains 4m — 1 points 
and spans 3m ordinary lines. 

Proof. This is illustrated in Figure [8] in the case m = 3. The ordinary 
lines are the 2m tangent lines to the 2m th roots of unity as well as 
m lines joining [cos — , sin — , ll and [cos — , sin — , 11 with ?' + f 

• ol rra' m'J L m" m'J j • j 

l(mod 2m). □ 



We may now state a still more precise result, which asserts that 
all configurations not equivalent to one of the above examples must 



14 



BEN GREEN AND TERENCE TAO 




Figure 8. The near-Boroczky example with n — 11 
points and 9 ordinary lines. The ordinary lines (in red) 
are the 6 tangent lines to the 6th roots of unity on the 
unit circle plus 3 lines passing through the removed point 
[- sin|,cos|,0]. 



necessarily have a significantly larger number of ordinary lines than 
f(n), when n is large. In fact there must be at least n — 0(1) ordinary 
lines in such cases. 

Theorem 2.4 (Strong Dirac-Motzkin conjecture). There is an absolute 
constant C such that the following is true. If P is a set of n points in 
MP 2 , not all on a line, spanning no more than n — C ordinary lines 
then P is equivalent under a projective transformation to one of the 
Boroczky examples or to a near-Boroczky example. 

The threshold n — C is sharp except for the constant C. Indeed we 
will shortly see that finite subgroups of elliptic curves of cardinality n 
give examples of sets with n — 0(1) lines. This gives infinitely many 
new examples of sets with few ordinary lines which are inequivalent 
under projective transformation due to the projective invariance of the 
discriminant of an elliptic curve. 

Sylvester's cubic curve examples. We turn now to Sylvester's exam- 
ples of point sets coming from cubic curves, as further discussed by 
Burr, Grunbaum and Sloane j5]. While these do not provide the best 
examples of sets with few ordinary lines, it appears that consideration 
of them is essential in order to solve the Dirac-Motzkin problem. Of 
course, they also feature in the statement of our main structural result, 
Theorem 1.5, and are optimal for the orchard problem (see Section [9]). 
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y 2 = x 3 U 2 = x 2 (x + 1) y 2 = x 2 (x — 1) 

cuspidal nodal acnodal 

Figure 9. The three different types of singular cubic curve. 

Finally, they provide essentially different examples of sets with n+0(l) 
ordinary points to any of those considered so far. 

For a leisurely discussion of all the projective algebraic geometry 
required in this paper, including an extensive discussion of cubic curves, 
we recommend the book 

Let 7 be any irreducible cubic curve. It is known (see [3J Chapter 
12]) that 7 has a point of inflection, that is to say a point where the 
tangent meets 7 to order 3). By moving this to the point [0,1,0] at 
infinity, we may bring 7 into the form y 2 = f(x) in affine coordinates, 
where f(x) is a cubic polynomial. If 7 is smooth then it is called an 
elliptic curve. An elliptic curve may have one or two components; these 



two cases are illustrated in Figure [10] If 7 has a singular point then it 
may be transformed into one of the following three (affine) forms: 

• (nodal case) y 2 = x 2 (x + 1); 

• (cuspidal case) y 2 = x 3 ; 

• (acnodal case) y 2 = x 2 (x — 1). 

See [3j Theorem 8.3] for details. These three singular cases are illus- 
trated in Figure |9j 

We remark that the classification of cubic curves over K has a long 
an honourable history dating back to Isaac Newton. 

The group law. Suppose that 7 is an irreducible cubic curve, and 
write 7* for the set of nonsingular points of 7. If 7 is smooth then of 
course 7 = 7*, and in this case we say that 7 is an elliptic curve. We 
may define an abelian group structure on 7* by taking the identity O to 
be this point of inflection on 7* and, roughly speaking, P @Q ® R = O 
if and only if P,Q,R are collinear. The "roughly speaking" refers to 
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the fact that we must take appropriate account of multiplicity, thus 
P '®P ' @Q = O if the tangent to 7 at P passes through Q. The inverse 
QP of P is defined using the fact that QP, O and P are collinear. See 
[21 Chapter 9] for more details, including a proof that this does indeed 
give 7* the structure of an abelian group. 

We have the following theorem regarding the nature of 7* as a group. 

Theorem 2.5. Let 7 be an irreducible cubic curve, and let 7* be the 
set of its nonsingular points. Then we have the following possibilities 
for'-y*, considered as a group. 

• (elliptic curve case) R/Z or M/ZxZ/2Z ; depending on whether 
7 has 1 or 2 components; 

• (nodal case) R x Z/2Z; 

• (cuspidal case) R; 

• (acnodal case) R/Z. 

Once again, details may be found in [3]. Thinking about the curves 
topologically, the theorem is reasonably evident. In the three singular 
cases isomorphisms <fi : G — > 7* can be given quite explicitly, as detailed 
in the following list. 

• In the nodal case y 2 = x 2 (x + 1), the map : R x Z/2Z — » 7* 
defined by 0(x, e) = (t 2 — 1, t(t 2 — 1)), where t = cothx if e = 
and t = tanhx if e = 1 provides an isomorphism; 

• In the cuspidal case y 2 = x 3 , the map : R — > 7* defined by 
4>(x) = (s, -\) provides an isomorphism; 

• In the acnodal case y 2 = x 2 (x — 1), the map <ft '■ R/Z — > 7* 
defined by <p(x) = (t 2 + l, t(t 2 + l)), where t = cot(7rx), provides 
an isomorphism. 

We leave the reader to provide the details. In the nodal case (for 
example) we recommend first proving that (t 2 — l,t(t 2 — 1), (u 2 — 
l,u(u 2 — 1)) and (v 2 — l,v(v 2 — 1)) are collinear if and only if —v = 
(l + tu)/(t + u). 

The following maps in the other direction, described in Silverman 
[32| III. 7] (the acnodal case is described in j32j Exercise 3.15]) are 
perhaps even tidier. Here 00 = [0, 1, 0]. 

• In the nodal case the map (x, y) i-> (y — x)/(y + x) and 00 h-> 1 
gives an isomorphism from 7* to R* = R x Z/2Z; 

• In the cuspidal case the map (x,y) h-> x/y and 00 h-» 1 gives 
an isomorphism from 7* to R; 

• In the acnodal case the map (x, y) h-> —(x + iy) 2 /x 3 and 00 h-> 
1 gives an isomorphism from 7* to the unit circle S l in the 
complex plane. 
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y 2 — x 3 — x y 2 = x 3 — x + 1 

Figure 10. Two elliptic curves, illustrating the group 
law and showing the two possibilites for the group struc- 
ture. 



Sylvester's examples. By a Sylvester example E n we mean a set of 
n points P in the plane which corresponds to a subgroup of order n of 
an irreducible cubic curve 7. If n > 2 the existence of such an example 
requires 7 to be either an elliptic curve or an acnodaj^] cubic curve, by 



the classification of the group structure of 7 described in Theorem 2.5 
A Sylvester example coming from an elliptic curve is depicted in Figure 

EU 

As it turns out, Sylvester examples have somewhat more ordinary 
lines than the Boroczky examples, namely n+0(l) instead of n/2+0(l) 
or 3n/4 + 0(1), and are thus not extremises for the Dirac-Motzkin 
conjecture. However, due to the more evenly distributed nature of the 
Sylvester examples, they have significantly more 3-rich lines. Indeed, 
the following is essentially established in [5J. 

Proposition 2.6. Let n ^ 3, and let E n be a subgroup of order n in 7* , 
the group of nonsingular points of an irreducible cubic curve 7 (which 
must be an elliptic curve or an acnodal cubic). Then E n contains 
n — 1 — 21 3 | n ordinary lines and [ n(n 6 ~ 3 ) j 4- 1 3-rich lines, where l 3 | n is 
equal to 1 when 3 divides n and zero otherwise. Furthermore, if x G E 
is such that x (jL E n and x@x@x G E n (which can only occur when n is 



3 We do not know whether Sylvester himself was interested in the acnodal case. We 
thank Frank de Zeeuw for correcting an oversight in the first version of this paper 
by drawing it to our attention. 



18 



BEN GREEN AND TERENCE TAO 




Figure 11. A Sylvester example with n = 8, the sub- 
group being isomorphic to Z/2Z x Z/4Z. The labels 
reflect the group structure, thus 03 corresponds to the 
element (0, 3) G Z/2Z x Z/4Z. This comes from an ellip- 
tic curve with equation y 2 = x 3 — ^x 2 — ^x + j||g = 
to which we have applied the projective transformation 
[x, y, z] i — y [x, y, x + y + z], so that the point at infin- 
ity maps to the point (0, 1) in the affine plane (which is 
then an inflection point for the curve). The are 7 ordi- 
nary lines, marked in red, and also 7 3-rich lines, marked 
in dotted green. 



divisible by 3, by Legendre's theorem), then E n © x has n — 1 ordinary 
lines and [ "( n ~ 3 ) j 3-rich lines. 

Proof. Let N2 be the number of ordinary lines, and ./V3 be the number 
of 3-rich lines. From Bezout's theorem no line can meet E n in more 
than three points, and so by double counting we have the identity 




A brief computation (splitting into three cases depending on the residue 
of n modulo 3) then shows that A^3 = |_ e; J + 1 if and only if N2 = 
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n — 1 — 21.3| n . But from the group law the number of ordinary lines 
is precisely equal to the number of elements a G E n such that —2a 
is distinct from a, or in other words the number n of elements in E n 
minus the number of third roots in E n . But 7* is isomorphic as a group 
to either R/Z or (R/Z) x (Z/2Z), and so E n is isomorphic to either 
Z/nZ or to (Z/(ra/2)Z) x (Z/2Z). It has 1 + 21 3]n third roots in either 
case, and the claim follows. 

The analysis in the shifted case E n © x is analogous, the only dif- 
ference being that E n © x does not contain any third roots of unity. 
□ 



Remarks. For the sake of comparison, the n-point examples in Propo- 
sition |2Tl can all be computed to have n 2 /8 + 0(n) 3-rich lines instead 



of n 2 /6 + 0(n) for the Sylvester examples. This discrepancy can be 
explained by the existence of a high-multiplicity line with n/2 + 0(1) 
points in those examples. This absorbs many of the pairs of points that 
could otherwise be generating 3-rich lines. 

We note also that the acnodal case allows for a quite explicit con- 
struction of a set of n points defining ~ n 2 /6 3-rich lines, without the 
use of Weierstrass jp-function which would be necessary in the ellip- 
tic curve case. We leave the reader to supply the details, using the 



parametrisation detailed after the statement of Theorem 2.5 We are 



not sure whether this point has been raised in the literature before. 

Near- counterexamples. In addition to the actual examples coming 
from Boroczky's constructions and from elliptic curve subgroups, there 
are also some important "near-counterexamples" which do not directly 
enter into the analysis (because they involve an infinite number of 
points, rather than a finite number), but which nevertheless appear to 
indirectly complicate the analysis by potentially generating spurious 
counterexamples to the structural theory of points with few ordinary 
lines. These then need to be eliminated by additional arguments. 

As with the previously discussed examples, the near-counterexamples 
discussed here will lie on cubic curves. But whilst the actual examples 
were on an elliptic curve, an acnodal singular cubic curve, or on the 
union of a conic and a line, the near-counterexamples will lie on three 
lines (which may or may not be concurrent), or on a non-acnodal sin- 
gular cubic curve. 
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We first consider a near-counterexample on three concurrent lines. 
Up to projective transformation, one can take the lines to be the par- 
allel lines 

{[zi,0,l] :xi GR}U {[1,0,0]} 
{[2;2,1,1] : x 2 GR}U {[1,0,0]} 
{[x 3 ,2,l]:x 3 GR}U{[l,0,0]}. 

Observe that [xi, 0, 1], [X2, 1, 1] and [a; 3 , 2, 1] are concurrent if and only 
if x\ + 23 = 2x2- Thus, if we consider the infinite point set 

P : = {[m, 0, 1] : m G Z} U {[n 2 /2, 0, 1] :n 2 G §Z} 

U{[n 3 ,0,l] : n 3 G Z} (2.1) 

then there are no ordinary lines whatsoever; every line joining a point in 
PC\£i with a point in P^£ 2 meets a point in Pfl£ 3 , and similarly for per- 
mutations. If Z could somehow have a non-trivial finite subgroup, then 
one could truncate this example into a counterexample to the Sylvester- 
Gallai theorem, i.e. a finite set with no ordinary lines. Of course, this 
cannot actually happen, but this example strongly suggests that one 
needs to somehow use the torsion-free nature of the additive group R 
at some point in the arguments, for instance by exploiting arguments 
based on convexity, or by using additive combinatorics results exploit- 
ing the ordered nature of R. One such example, a variant of which we 
prove in Lemma A. 3 is the trivial inequality |A + .B| ^ \A\ + \B\ — 1 for 
finite subsets A, B of R. This can be viewed as a quantitative version 
of the assertion that R has no non-trivial finite subgroups. 

There is a similar near-counterexample involving three non-concurrent 
lines. Again, after applying a projective transformation, we may work 
with the lines 

: {[x,0, 1] : x G R}U {[1,0,0]} 

:{[0 )2 /,l]:xGR}U {[0,1,0]} 

= {[-*, 1, 0] : 3/ eR}U {[1,0,0]}. 

Observe that if x,y,z G R x := R\{0}, then [x,0,l], [0,y,l] and 
[—z, 1,0] are concurrent precisely when z = x/y. Thus, if we consider 
the infinite point set 

P := {[2 ni , 0, 1] : m G Z} U {[0,2 n2 , 1] : n 2 G Z} 

U{[-2 fl3 ,l,0]:n 3 GZ} (2.2) 

then again there are no ordinary lines: every line joining a point in 
P C\£i with a point in P fl £2 meets a point in P R £3 , and similarly for 
permutations. As before, this example suggests that the torsion-free 
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nature of the multiplicative group M x must somehow come into play 
at some point in the argument. 

Finally, we give an example that lies on a cuspidal singular cubic 
curve, which after projective transformation can be written as 

7 := {[x,y,z] : yz 2 = x 3 }. 

Removing the singular point at [0, 1, 0], we may parameterise the smooth 
points 7* of this curve by { [t, t 3 , 1] : t G R}. One can compute af- 
ter a brief determinant computation that three distinct smooth points 
[ti,if,l], [t 2 ,t2)l] and ^3,^,1] on the curve are concurrent precisely 
when ti + t 2 + £3 = 0. Thus, if one sets P to be the infinite set 

P ■= {[n,n 3 , 1] : n G Z} (2.3) 

then there are very few ordinary lines - indeed only those lines that 
are tangent to 7 at one point [n, n 3 , 1] and meet 7 at a second point 
[(— 2n), (—2n) 3 , 1] for some n G Z\{0} will be ordinary. This example 
can be viewed as a degenerate limit of the Sylvester examples E n when 
the discriminant is sent to zero and n sent to infinity. Again, it can 
be ruled out, but only after one exploits the torsion- free nature of the 
group associated to 7*, which in this case is isomorphic to R. Similar 
remarks also apply to nodal singular cubic curves such as {[x, y, z] : 
y 2 z = x 3 + x 2 z}, the smooth points of which form a group isomorphic 
to R x (Z/2Z), which is essentially torsion free in the sense that there 
are no large finite subgroups. 



A variant of the example (2.3) lies on the union 



{[x,y,z] :yz = x } U {[x,y,z] : z = 0} 

of a parabola and the line at infinity. Observe that two points [t±, t\,l], 
\p2, t\, 1] on the parabola and a point [0, £3, 1] on the line at infinity with 
ti, £3 G R are concurrent if and only if £3 = t± + t 2 . Thus, the infinite 

set 

P := {[n, n 2 , 1] : n G Z} U {[0, n, 1] : n G Z}, (2.4) 

which can be viewed as a degenerate limit of a Boroczky example, has 
very few ordinary lines (namely, the line tangent to the parabola at 
one point [n,n 2 , 1] and also passing through [0,2n, 1]). 

The existence of these near-counterexamples forces us to use a some- 
what ad hoc case-by-case analysis. Tools such as Chasles's version of 
the Cayley-Bacharach theorem, which are valid for all cubic curves, get 
us only so far. They must be followed up by more specialised arguments 
exploiting the torsion or lack thereof in the group structure. In this 
way we can rule out near-counterexamples involving triples of lines, or 
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singular irreducible cubics, until only the Boroczky and Sylvester type 
of examples remain. 

3. Melchior's proof of the Sylvester-Gallai theorem 

In this section we review Melchior's proof |26j of the Sylvester-Gallai 
theorem. As mentioned in the introduction, this is the starting point 
for all of our arguments. 



Theorem 1.1 (Sylvester-Gallai, again). Suppose that P is a finite set 
of points in the plane, not all on one line. Then P spans at least one 
ordinary line. 

Proof. Let P be a set of n points in MP 2 . Consider the dual collection 
P* := {p* : p G P} of n lines in MP 2 . These lines determine a graph^] 
V p on IRP 2 whose vertices are the intersections of pairs of lines p\ , p\ (or 
equivalently points £*, where £ is a line joining two points of P), and 
whose edges are (projective) line segments of lines in P* connecting 
two vertices of Tp with no vertex in the interior. Note that as the 
points in P were assumed not to lie on one line, every line in P* must 
meet at least two vertices of Tp; in particular, the graph Tp contains 
no loops. (It is however possible for a line to meet exactly two vertices 
in Tp, in which case those two vertices are joined by two edges, rather 
than one.) Also, by construction, each vertex of Tp is incident to at 
least two lines in P*. As such, the graph Tp partitions the projective 
plane MP 2 into some number V of vertices, some number E of edges, 
and some number F of faces, each of which is the projective image of 
a polygon. In particular, each face has at least three edges, and any 
edge is incident to two distinct faces. 

By Euler's formula in the projective plane MP 2 we hava^] 

V-E + F = l. (3.1) 

To proceed further, suppose that for each k — 2, 3, 4, . . . the set P has 
Nk lines containing precisely k points of P. Then V, which by duality 
is the number of lines defined by pairs of points in P, satisfies 

n 

V = J2 N k- (3-2) 

k=2 



4 Strictly speaking, Tp determines a drawing of a graph in the projective plane, 
because we are viewing the edges as curves in MP 2 rather than abstract pairs of 
vertices, but we shall abuse notation by identifying a graph with its drawing. 
5 The Euler characteristic x(KJP 2 ) = 1 of the projective plane is of course half of the 
Euler characteristic x(S 2 ) = 2 of the sphere, as the latter is a double cover of the 
former. 
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Furthermore the degree d(£*) of a vertex £* in our graph is twice the 
number of lines in P* passing through £*, which is 2\P D £\. Thus, 
summing over all lines £, 

n 

2E = ^d{t) = 2^\Pr\£\ = ^2kN k . (3.3) 

£ I k=2 

Finally, for s = 3, 4, 5, . . . write M s for the number of faces in Tp with 
s edges. Since each edge is incident to exactly two faces, we have 

n 

2E = J2 sM s- (3-4) 

s=3 



Combining (3.1), (3.2), (3.3) and (3.4) gives the following expression 



for A^2, the number of ordinary lines: 

n n 

It follows immediately that N2 ^ 3, which of course implies the Sylvester- 
Gallai theorem. □ 

After discarding the non-negative term ^^ =4 (s — 3)M S , the equation 



(3.5) implies Melchior's inequality 

n 

N 2 > 3 + J2( k - 3 ) N k 



k=4 



In this paper, however, we will need to save the term Yl™=i( s ~ 3)M S , 
as it gives crucial control on the geometry of the dual configuration Tp, 
ensuring that this configuration resembles a triangulation when N2 is 
small. More precisely, we have the following proposition. 

Proposition 3.1 (Few bad edges). Suppose that P is a set of n points 
in the projective plane MP 2 , not all on a line, and suppose that P has 
at most Kn ordinary lines. Consider the planar graph Tp obtained by 
dualising P as described above. Then Tp is an "almost triangulation" 
in the following sense. Say that an edge ofT P is good if both of its 
vertices have degree 6, and if both faces it adjoins are triangles. Say 
that it is bad otherwise. Then the number of bad edges inTp is at most 
16Kn. 

Proof. From ( |3.5[ ) we have 

n n 

^sM s <4^(s-3)M s <4iV 2 <4i<:n. (3.6) 

s=4 s=4 



24 



BEN GREEN AND TERENCE TAO 



Secondly, let us observe that 

d (£*) ^ 12Kn - (3-7) 
To see this, recall that d(t) = 2\P n£\. We thus obtain 

Y d(t) = 2 i pn ^i 

e-.d{e*)>6 i-.\pne\>3 

= Y 2kN k < § ~ 3 ^ Nk ^ 8KU - 

Noting also that 

d(t) = 2 \P^e\=4N 2 ^AKn, 

i:d{l*)=i l:\PC\l\=2 



(3.7) follows. 

Now we can place an upper bound on the number B of bad edges. 
Each face with s > 3 sides gives s bad edges, and each vertex £* with 
degree d(£*) ^ 6 gives d(£*) bad edges. As these are the only sources 
of bad edges, we have 

s>3 £:d(t*)^6 



by (3.6) and (3.7). □ 



4. Triangular structure in the dual and cubic curves 

In general, the number of edges overall in Tp is expected to be of the 
order of n 2 (cf. Beck's theorem [2]). Thus, when K is small, Proposition 



3J] should be viewed as an assertion that almost all the edges of Tp are 
good. For instance, it shows that any dual line p*, p G P should contain 
at most O(K) bad edges on the average. Intuitively, this suggests that 
Tp is an "almost triangulation" in which most vertices have degree 6 
and most faces are triangles. In Section [5] we will use this information 
to put the points of P on a small number of cubic curves, which will 
be our starting point for more powerful structural theorems on P. 
By a cubic curve we mean a set of points in MP 2 of the form 

{[X, Y, Z] : ai X 3 + a 2 X 2 Y + a 3 XY 2 + a 4 Y 3 + a 5 X 2 Z 

+ a 6 XYZ + a 7 Y 2 Z + a 8 XZ 2 + a 9 YZ 2 + a 10 Z 3 = 0} 

for some coefficients a%, . . . , a\o G R, not all zero, or in other words the 
locus of a nontrivial homogeneous polynomial of degree 3. Note that 
we do not assume this polynomial to be irreducible. In particular, we 
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consider the union of three lines, as well as the union of a conic and a 
line, to be examples of cubic curves. 

A key observation in our arguments will be the fact that pockets of 
true triangular structure in the dual T signify a collection of points of 
P lying on a single cubic curve. Results of this type may be found 



in Lemmas 4.4 and 4.5 below. A key ingredient will be the following 
incredibly classical fact from projective geometry, usually known as the 
Cayley-Bacharach theorem (although the case we require was proven 
by Chasles [9], prior to the more general results of Cayley [8] and 
Bacharach PQ). 

Proposition 4.1 (Chasles). Suppose that two sets of three lines define 
nine distinct points of intersection in MP 2 as shown in Figure 12 below. 



Then any cubic curve passing through eight of these points also passes 
through the ninth. 

See [13J or the blog post |36j for a discussion of this result, including 
its link to Pappus's theorem, Pascal's theorem, and the associativity 
of the group l aw o n an elliptic curve. 

Proposition 4A allows one to establish some duality relationships^] 



between triangular grids and cubic curves. We first define what we 
mean by a triangular grid: 

Definition 4.2 (Triangular grid). Let /, J, K be three discrete intervals 
in Z {thus I takes the form {i G Z : i_ ^ i ^ i + } for some integers 
i-,i+, and similarly for J and K). A triangular grid with dimensions 
I, J,K is a collection of lines (p*)iei, (Qj)jeJ, ( r t)keK in MP 2 , which we 
will view as duals of not necessarily distinct points Pi,q-j,rk in WP 2 , 
obeying the following axioms: 

(i) Ifi£l,jE.J,k&K are integers with i + j + k = 0, then 
the lines p*,q*j,r^ are distinct and meet at a point P^. Fur- 
thermore, this point P^ is not incident to any line in the 
grid which is not already identical to one of the lines p\,q*j,r\. 
Thus, for instance, if %' G I is such that p*, ^ P*,Q*,rl, then 
p*, cannot contain Pijk. 

(ii) If i G I,j,f G J,k,k' G K are such that i + j + k = i+j' + 
k' — and < \j — j'\ ^ 2 (or equivalently < \k — k'\ ^ 
2), then the intersection points P^k and P^k' are distinct. In 
particular, this forces q* ^ q*, and r* k ^ r* k , . Similarly for cyclic 
permutations ofi,j,k and of j',k'. 



An example of a triangular grid is depicted in Figure 13 



Let us note once again that rather similar results were obtained earlier, in a com- 
pletely different context, by Carnicer and Godes [7]. 
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Figure 12. Chasles's theorem. There are two sets of 
three lines: the solid lines {p , ri}, {pi,qo, r -i}, 
{p-i,qo, r i} and the dotted lines {p , qi, r_i}, 
{pi,g_i,r } and qo, r{\. The nine points of 

intersection p 0) Pi ; <?-i) <?cb <?i) r -i ; r o an d ri are 
all distinct. Any cubic curve passing through 8 of these 
points also passes through the 9th; one such curve is 
shown. 



The following basic consequence of Proposition AA_ drives our whole 
argument. 

Lemma 4.3 (Completing a hexagon). Let io,jo,ko be integers with 
io+jo + ko = 0, let I := {i -l,i ,i + l} , J := {j -l, j , j + l} , K : = 
{k - 1, k , k Q + 1}, and let (pi) ie i, (qj)jeJ, (r k ) keK be triples of points 
whose duals form a triangular grid with dimensions I, J, K. Then the 
nine points (pj)i 6 /, (qj)jej, (r k ) keK are distinct, and any cubic curve 
which passes through eight of them passes through the ninth. 

Proof. By relabeling, we may assume that io = jo = k Q = 0, thus the 
nine points are p_i,po,Pi, Q-i, Qo, Qi, ?"q, Once it is shown that 
these nine points are distinct, their duals form a "hexagon" as depicted 



in Figure 14 below as part of a larger triangular grid. The configuration 



in Figure 14, however, is precisely the dual of the configuration of 9 



points appearing in Chasle's theorem (see Figure 12), and the claim 
then follows. 
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r W r 9 r 8 r 7 r l r 5 



r; rz r, r; 




9_i 9-g 9-g q- 7 q- 6 9- 5 9-4 9-3 2-2 9-i 



Figure 13. A triangular grid 
{-2,..., 2}, {-10,...,-!} and {1,. 



with dimensions 
.,10}. 



It remains to establish the distinctness of the nine points. By apply- 



ing Definition |4.2 (ii) to the intersections of p*,9o,T*j for i = —1,0, 1 
we see that the p_i,po,Pi are distinct; similarly for g_i,g ,<?i and 
r_i, ro, T\. Next, from Definition |4.2[ i) we see that pi and qj are distinct 
as long as — 1 ^ i + j ^ 1, and similarly for cyclic permutations. The 
only remaining claim left to check, up to permutations and reflections, 
is that pi and qi are distinct. But if these two points coincided, then 
the intersections of pl,qli,r^ and P*i,?*,?"o w °uld then also coincide, 
contradicting Definition |4.2[ii). □ 



We now iterate the above proposition. 

Lemma 4.4. Suppose that m ^ 4 is an integer and that are 
integers with 2 ^ i + ^ m — 2 and 2 — m ^ i_ ^ —1. Suppose that 
we have a collection of points (pi)i_^i + , (qj)- m ^j^-i and (rk)i^k^m in 
MP 2 whose duals form a triangular grid with the indicated dimensions 
(The case i_ = —2, i + = 2 and m = 10 is illustrated in Figure 13). 
Then all of the points Pi, qj, r^ lie on a single cubic curve 7. 

Proof. Consider the nine points p-\,po,pi,p2, qs, q~2, q~i, ?"2- The 
space of cubic homogeneous polynomials is a vector space of dimension 
10, and so by straightforward linear algebra there is a cubic curve 7 
containing these nine points P-i,Po, pi, P2, <?-3, q~2, Q-i, r i and r 2 . (Note 
that it is not necessary for the nine points to be distinct in order to 



28 



BEN GREEN AND TERENCE TAO 




Figure 14. The dual of the configuration in Chasles's 
theorem, showing a "hexagon" formed by the duals of 
two sets of three lines. The light grey lines are duals 



of other points on the cubic curve shown in Figure 12 



specifically those points in longer arithmetic progressions 
(in the group law on 7) containing the Pi, qj, r&. We have 
included them mainly for aesthetic interest, but also as 
a more complicated example of a triangular grid. 



obtain this claim.) We will now claim that all the remaining points 
Pi, qj,r k in the configuration also lie on 7. 
Firstly, by applying Lemma 4.3 to the set 

P-i,Po,Pi, 9-3, 9-2, q-i, r lt r 2 , r 3 

we see that as eight of the points already lie in 7, the ninth point r 3 
must also. If one then applies Lemma |4~3 to the sets 

Po,Pi,P2, g-3, 9-2, q-i, r 2 , r 3 , r 4 

and 

Po, Pi, P2, q-i, 9-3, q-2, ri, r 2 , r 3 
we conclude that g_4 and also lie in 7. By shifting the q indices down 
by one and r indices up by one repeatedly, we may then inductively 
place g_fc and in 7 for all 4 ^ k ^ m. Finally, by applying Lemma 



473] inductively to the sets 

Pi-i,Pi,Pi+x, q-i-3, q-1-2, q-i-i, n, r 2 , r 3 

for % = 2, . . . , i + — 1 (noting that — i — 3 ^ — i + — 2 ^ —m) we may 



place Pi in 7 for all 2 < i ^ i + , and similarly by applying Lemma 4.3 
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inductively to 

P-i-uP-U P-i+i, 1-3, 9-2, q-i,r i+1 ,r i+2 , r i+3 

for i = 1, . . . , — i- — 1 we can also place p-i in 7 for all 1 < i ^ — z_. 
This concludes the proof of the claim. □ 

This lemma is already enough to imply our most basic structural 



result for sets with few ordinary lines, Proposition 5.1 To get stronger 



results, such as Proposition 5.3, we need to perform a deeper analysis. 



The new feature in the following lemma is the last statement. 

Lemma 4.5. Suppose that L ^ 10 and that m ^ 10L. Suppose that 
we have a collection of 4L + 1 + 2m points (pi) -2L^2L , (qj)-m^j4-i 
and {rk)i^k^m i n B^P 2 whose duals form a (4L + 1) x m x m triangular 
grid with the indicated dimensions. Assume furthermore that the points 
Pi,Qj, r k ore all distinct. Then all of the points Pi,qj,r k lie on a single 
cubic curve 7, each irreducible component of which contains at least L 
of the points pi, qj, r k . 



Proof. Note from Definition 4.2 and the distinctness of the Pi,qj,rk 
that the intersection points = p*r\q*r\rl in the grid are all distinct. 

That all the Pi, qj, r k lie on a single cubic curve 7 follows from Lemma 
WM 

If 7 is already an irreducible cubic then we are done. By enlarging 7 
via the addition of extra lines if necessary, we may otherwise suppose 
that we are in one of the following two cases: 

Case 1: 7 is the union of three distinct lines £,£',£"; 

Case 2: 7 is the union of an irreducible conic a and a line I. 

In each case we are to show that all irreducible components contain 
at least L points pi, qj, r^. 

In Case 1, consider a triple of points Pi,qj,Tk with i + j + k = 0. 
Since Pi, qj, r^ are collinear and lie on i U £' U £", one of the following 
two possibilities holds: 

(i) One of the three lines £, £', £" is incident to all three of pi, q^r^ 
(i.e. the line {pi,qj,rk} is one of £,£', or £"); 

(ii) Pi,qj,r k lie on one of each of the lines £,£',£" (for instance, 
one could have Pi G £', qj G £", r k G £, or any of the other five 
possible permutations). Note that we allow a point to lie on 
more than one of the lines £, £', £". 

First of all note that (i) cannot hold for more than three triples 
(i,j,k) with i + j + k = 0. Indeed, as observed previously, the in- 
tersection points Pijk := p* PI q* fl r* k are all distinct, and so the lines 
containing {pi,qj,r k } are distinct for distinct triples (i,j,k). 
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Let fl be the set of triples k) with i + j + k = 0, — L ^ i ^ L, 
—2L < j < —L and L < k < 2L. Suppose that (i) holds for some 
triple k) G Q and that p«, g^, r k all lie on £ (say). We now consider 
the triples (i',f, k') G Q with % ^ %' , j ^ j', k ^ k'. With at most two 
exceptions, (ii) holds for any such triple. Fix one of these triples for 
which (ii) holds. One of the points Pi>,qj>,r k > then lies on £. Suppose 
that lies in £. But, noting that 1 ^ —i' — j ^ m, we see that 
Pi',qj and r^it_j are collinear and so r_j/_j lies on £ as well. That 
is, the lines containing {pi,qj,r k } and {pj/, qj, r_j/_j} are the same. 
This is a contradiction as we noted above. Similarly, if lies in £, 
then 1 ^ — i — j' ^ m and we can conclude that the lines containing 
{pi,qj,rk} and {pi, q^, r_j_j/} are again coincident, a contradiction. 
Finally, if ?v lies in £, then — m ^ — % — k' ^ 1 and the lines containing 
{Pi,qj,r k } and {pi, q-i-k>, r k '} are coincident, again a contradiction. 

It follows that, whenever (i,j,k) G Q, we are in case (ii) and not 
in (i), that is to say the points Pi,qj,r k lie on one of each of the lines 
£,£',£", but do not all lie on one of the lines £, £', or £". Suppose 
without loss of generality that po G £, g_2L G £', r 2 L G £". If q~2L+i G £" 
then the concurrent points g_2L+i,^2L all he in £"; as (—1, — 2L + 
1,2L) G f2, we obtain a contradiction. Similarly, if g_2L+i G £, then 
Po, ?-2L+i, f2L+i all lie in £, again a contradiction. Thus g_2L+i G 
which implies r2L-i G Repeating this argument we see that in fact 
all of the points qj, —2L ^ j < —L, lie on £' and all of the points r k , L < 
k ^ 2L, lie on £". Finally, considering the triple (i, — 2L — i, 2L) G Q, 
we see that all of the points Pi, ^ % < L, lie on £. We have established 
that each of the lines £, £', £" contains at least L of the points Pi, qj, r k , 
concluding the proof of the lemma in Case 1. 

We turn now to Case 2, where the argument is very similar. Consider 
once again a triple of points pi, qj, r k with i + j + k = 0. These lie on 
a line. By Bezout's theorem, there are two cases: 

(i) Pi,qj,r k all lie on l\ 

(ii) two of Pi, qj, r k lie on a and the other lies on £. 

If (i) ever holds for some triple k) then there is at most one such 
triple. Suppose it holds for some triple (i,j,k) G Q. There are again 
many triples (i',j',k') G f2 with i ^ i' , j ^ j', k ^ k'. For any 
such triple, two of py, qj/, r k > lie on a and the other lies on £. Suppose, 
without loss of generality, that pi> G £. Then, noting that 1 ^ —i'—j ^ 
m, we see that p^, qj, r^^-j are collinear and so lies on £ as well, 

and thus (i) also holds for the triple — j). This leads to a 

contradiction exactly as before. 
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It follows that, whenever (i,j,k) G Q, two of the points Pi,qj,r k lie 
on a and the other lies on £. If po G a then one of g_ j , Tj lies on a 
and the other lies on £, for each j with L < j ^ 2L. Thus both cr 
and £ contain at least L of the points Pi,qj,rk- If, on the other hand, 
Po G £, then all the q~j,rj with L < j ^ 2L lie on ex. But for each i, 
\i\ ^ L — 1, there are some j, A; with — 2L ^ j < —L and L < k ^ 2L 
such that (i,j,k) G f2, and so G £ for all these i too. Thus both £ 
and a contain at least L of the points Pi, qj,rj, in this case also. □ 

We remark that this analysis can be pushed further in order to say 
something about the distribution of the points p i: qj,r k on (for example) 
three lines £, £', £". One could most probably give some kind of complete 
classification of (say) 100 x 100 x 100 triangular grids. However it is 
also possible to take a self-contained additive-combinatorial approach, 
leading to better bounds, and this is the technique we pursue in Section 

El 

To conclude this section let us remark that a number of beautiful 
pictures of triangular structures arising from cubic curves (for various 
different types of cubic) may be found in the paper [UJ, another work 
in the interpolation theory literature. 

5. Almost triangular structure and covering by cubics 
Recall that if P C MP 2 is a set of points then Tp is the graph defined 



by the dual lines p*, p 6 P. We now know (Proposition 3.1) that if 



P has few ordinary lines then Tp has a highly triangular structure. 



We also understand (Lemma 4.4) that triangular structure in Tp cor- 
responds to points of P lying on a cubic curve. In this section we put 
these facts together to prove some of the structural results stated in 
the introduction. 



The main result is Lemma [572] , whose statement and proof are some- 
what technical. To convey the main idea (and because we will need it 
later, and because it may be of independent interest) we first establish 
the following much easier result. This result also comes with better 
bounds - indeed it says something even if one only knows that P spans 
o(n 2 ) ordinary lines - than our more technical later result. 

Proposition 5.1 (Cheap structure theorem). Suppose that P is a finite 
set of n points in the plane. Suppose that P spans at most Kn ordinary 
lines for some K ^ 1. Then P lies on the union of 500/T cubic curves. 

Proof. We first dispose of a degenerate case. Suppose that one of the 
dual lines p*, p G P, meets fewer than 500i^ points in Tp. Then every 
dual line meets one of these points, which means that P is covered by 
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at most 500K lines. As every line is already a cubic curve, we are done 
in this case. Thus we may assume that each dual line p* meets at least 
500K points in Tp. In particular, it meets at least three points of Tp. 

Recall the definition of a "good edge" of Tp: an edge both of whose 
vertices have degree 6, and where both faces adjoining it are triangles. 

Let us say that an edge is really good if all paths of length two 
from both of its endpoints consist entirely of good edges. If we have 
a segment S of I ^ 1 consecutive edges on p*, all of which are really 
good, then the structure of Tp is locally that of a triangular grid with 
dimensions {—2, . . . , 2}, {—I — 4, ... , —1}, {1, ...,/ + 4}; note that 
the distinct intersection property of Definition |4.2| (i) is automatic since 
every dual line is assumed to meet at least three points in Tp. Applying 
Lemma 4A , we conclude that if S is such a segment of consecutive really 
good edges, containing at least one edge, then the set of q G P \ {p} 
for which q* meets S all lie on a cubic curve 75- (which also contains 
p). 

If an edge is not really good, we say that it is somewhat bad. We 
know, by Proposition 3T that the number of bad edges is at most 
IQKn. Now associated to any somewhat bad edge e is a path of length 
1, 2 or 3 whose first edge is e and whose last edge is bad, and which 
is furthermore the only bad edge on that path (take a minimal path 
starting in e and ending in a bad edge). The number of paths of length 
3 of the form bad-good-good is at most IQKn x 5 x 5, since each vertex 
of a good edge has degree 6. Taking account of paths of length 2 
and 1 as well, we obtain an upper bound of 500Kn for the number of 
somewhat bad edges. 

By the pigeonhole principle there is a line p* which contains t ^ 5007^ 
somewhat bad edges. These somewhat bad edges partition p* into t 
segments of consecutive really good edges (a segment may have length 
zero). Let the segments with at least one edge be Si, . . . , Sf, and let the 
segments of length zero, which are simply vertices, consist of vertices 
v t '+i, ■ ■ ■ ,v t - 

If q G P \ {p}, then q* meets p* either in a vertex of one of the Si, 
or in one of the additional vertices Vj. In the former discussed 
previously, Lemma 4.4 places q in a cubic curve 7% depending on S^ 
In the latter case, then either q lies in the dual line v*. Such a dual line 
can be thought of as a (degenerate) cubic curve. Taking the union of 
all these cubic curves, of which there are at most £' + (£ — t') ^ 500if, 
gives the result. □ 



Proposition 5A_ is already a fairly strong structure theorem for sets 
with few ordinary lines. It is possible that the ordinary lines in a union 
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of 0(1) cubics can be analysed directly, though this certainly does not 
seem to be straightforward. Fortunately, there is much more to be 
extracted from Proposition 3^ and the results of Section |4j enabling 



us to prove more precise statements that refine Proposition 5.1 albeit 
with somewhat worse explicit constants. 

The next lemma is the main technical result of this section. In an 
effort to make the paper more readable, we have formulated it so that, 
once it is proven, we will have no further need of the dual graph Fp 
and consequences of Melchior's inequality. 

Lemma 5.2. Suppose that P is a set of n points in the plane. Suppose 
that P spans at most Kn ordinary lines for some K ^ 1, and let 
L ^ 10 be a parameter. Suppose that P cannot be covered by a collection 
of 4L concurrent lines. Then for every p G P there is a partition 
P = {p} U Ei jP U • • ■ U S Cp P with the following properties: 

(i) Fori = 1, . . . , c p the points o/X^p lie on a (not necessarily irre- 
ducible) curve 7i jP of degree at most three, which also contains 
p; 

(ii) //7i, p is not a line, then each irreducible component of it con- 
tains at least L points of P; 

(iii) // 7j )J3 is not a line, then the points of may be partitioned 
into pairs (q, r) such that p, q, r are collinear, and no other 
points of P are on the line joining p, q and r; 

(iv) We have the upper bound Y2 P ep c p ^ 2 19 L 3 Kn on the average 
size of c p . 

Proof. The proof of this lemma is basically the same as the last one, 
except now we work with a considerably enhanced notion of what it 
means to be a "really good edge" . Call an edge to be extremely good 
if all paths of length 2L from both of its endpoints consist entirely of 
good edges. In the last proposition, we only needed paths of length 2. 
If an edge e is not extremely good, let us say that it is slightly bad. 

We now count the number of slightly bad edges by an argument 
similar to that used to prove the previous proposition. Let e be a 
slightly bad edge, and let r be the length of the shortest path from an 
endpoint of e to a vertex of a bad edge, thus ^ r ^ 2L — 1, and 
there is a vertex v of a bad edge e' that is distance exactly r from an 
endpoint w of e. Then all paths of length up to r from either of the 
vertices of e are good, which means that the r-neighbourhood of e has 
the combinatorial structure of a triangular grid, and also that v lies on 
the boundary of this neighbourhood and has degree six. Among other 
things, this implies that among all the paths of length r from v to w, 
there is a path that changes direction only once. To describe this path, 
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as well as the slightly bad edge e, one could specify the the bad edge 
e', followed by an endpoint v of that bad edge of degree six, followed 
by an edge emenating from v, which is followed along for some length 
r\ to a vertex of degree six, at which point one switches to one of the 
other four available directions and follows that direction for a further 
length r 2 , with 7*1 + r 2 ^ r (so in particular ^ ri, r 2 ^ 2L — 1), until 
one reaches a vertex v, at which point the slightly bad edge e is one of 



the six edges adjacent to v. From Proposition 3.1 and simple counting 
arguments, we may thus bound the total number of bad edges e crudely 
by 

l6Kn x2x6x2Lx4x2Lx6^ 2 15 KL 2 

and so we conclude that the number of bad edges is at most 2 15 KL 2 . 
One could save a few powers of two here by being more careful, but we 
will not do so. 

Suppose that there are b p slightly bad edges on p*. Then 



■ p <: 2 15 L 2 Kn. (5.i; 



If we have a segment of m ^ 10L consecutive edges on p* , all of which 
are extremely good, and with p* containing at least 4L additional edges 
beyond these m, then the structure of Tp is locally that of a triangular 
grid of dimensions {— 2L, . . . , 2L}, {— m, . . . , — 1}, {1, . . . , m}. Note 
that as we are assuming that P cannot be covered by 4L concurrent 
lines, every dual line p* meets at least 4L + 1 distinct points in Tp, 
ensuring the disjointness property in Definition |4.2| (ii). Indeed, the fact 
that each dual line meets at least 4L + 1 distinct points, and that p* 
contains at least 4L additional edges beyond the m consecutive edges, 
ensures that all the lines in this triangular grid are distinct. 



Thus if 5* is such a segment then, by Lemma 4.5, the set £$ of all 
q G P \ {p} for which q* meets S all lie on a cubic curve 75 which 
contains p, and each component of which contains at least L points 
of P. Furthermore, since the lines q* meet the vertices of S in pairs 
(since each such vertex certainly has degree 6) the points of Eg may 
be divided into pairs (q, r) such that p, q, r are collinear, and no other 
point of P lies on the line joining p, q and r. Compare with conclusion 
(iii) of this lemma. 

The line p* is divided into b p segments, each containing one or more 
vertices, by the slightly bad edges. We then create some subsegments 
Si, . . . , S t by the following rule: 

(i) If p* contains at most 14L edges in all, then we set t = 0, so 
no subsegments Si, . . . ,S t are created; 
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(ii) If p* contains more than 14L edges, and one of the segments S 
cut out by the slightly bad edges contains all but at most 4L 
of the edges, we set t = 1, and define Si to be a subsegment of 
S omitting precisely 4L edges; 

(iii) In all other cases, we set Si, ... , S t be those segments cut out 
by the slightly bad edges with at least 10L edges. 

We then let v t+ i, . . . , v Cp be the vertices not contained in any of the 
Si, . . . , Sf By construction, we see that we always have t ^ b p and that 
the number c p —t of remaining vertices v j is at most max(14L, 4L, (10L+ 
l)b p ) < (10L + l)b p + UL. We thus have 

c p (WL + 2)b p + UL. (5.2) 

Define £j )P := and 7« iP := 7^ for i ^ t, and for i ^ t + 1 let 7^ 
be the line v* and take E i)P to consist of the points of P \ {p} lying on 
this line. 

This collection of cubics and lines has properties (i), (ii) and (iii) 



claimed in the lemma. The bound (iv) follows immediately from (5.1) 



and (5.2) and the crude bound (10 L + 2)2 15 L 2 K + 14L ^ 2 19 L 3 K, valid 



for L ^ 10 and K > 1. □ 

We are now in a position to prove a result which is still not quite 
as strong as our main structure theorem, Theorem 1.5[ but is still 



considerably more powerful (albeit with worse explicit constants) than 
the rather crude statement of Theorem 15.11 



Proposition 5.3 (Intermediate structure theorem). Suppose that P 
is a finite set of n points in the plane. Suppose that P spans at most 
Kn ordinary lines for some K ^ 1. Then one of the following three 
alternatives holds: 

(i) P lies on the union of an irreducible cubic 7 and an additional 
2 75 K 5 points. 

(ii) P lies on the union of an irreducible conic o and an additional 
2 64 fT 4 n lines. Furthermore, a contains between | — 2 W K 5 and 
|+2 76 i^ 5 points of P, and P\a spans at most2 m K A n ordinary 
lines. 

(iii) P lies on the union of 2 l6 K lines and an additional 2 87 K 6 
points. 

Remark. The explicit expressions such as 2 75 K 5 in the above propo- 
sition could of course be replaced by the less specific notation 0(K°^) 
if desired, and the reader may wish to do so in the proof below as well. 
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Proof. If P can be covered by 60000K ^ 2 K concurrent lines then 
we are of course done, so we will assume that this is not the case. 



By Proposition 5A we know that P is covered by at most 500-fT cubic 
curves of degree at most three. By breaking each of these curves up into 
irreducible components, we may thus cover P by distinct irreducible 
cubic curves 71, . . . , 7 m for some 

m ^ 1500K. (5.3) 

By Bezout's Theorem, no pair of distinct irreducible curves intersects 
in more than 9 points, and so there is a set P' C P, with 

\P\P'\^(™\ <: 2 24 K 2 , 

such that each point of P' lies on just one of the curves 

Suppose first of all that one of the 7,, say 71, is an irreducible cubic 
and contains at least 2 76 K 5 points of P. Then it also contains at least 
2 75 K 5 points of P' . Write n := \P' D 71 1: thus n > 2 75 K 5 . 

By construction and ( |5.3 ), P is not covered by 40m concurrent lines. 
Applying Lemma 5.2 with L := 10m, we see that for each p' G P 
we may partition P as {p 1 } U U • ■ ■ U £ c where Yli P '^p c p' ^ 
2 19 (10m) 3 i ; rn ^ 2 61 K 4 n and each Ej jP / is contained in some (not nec- 
essarily irreducible) cubic 7^/ containing p' which is either a line, or 
has the property that each irreducible component of it contains at least 
10m points of P. 

By the pigeonhole principle, we can find there is some p' G P' H 71 
with the property that c p i ^ 2 61 K A n/n . Fix this p'. By Bezout's 
theorem, an irreducible curve of degree at most three that is not already 
one of the 7, meets P in no more than 9m points, and so we infer that 
each 7jy is either a line, or else every irreducible component of it is 
one of the jj. Since p' lies on 71 but not on any other 7j, we infer 
that all the j itP ' are lines except that one of them, say 7iy, may be 71. 
Furthermore, none of the lines 7jy, j = 2, . . . ,c p r, which all contain p', 
coincides with any of 72, • • • ,7m- By another application of Bezout's 
theorem, each of them contains at most 3m points of P. 

It follows that 

V 

n = \P\ < l+|PH7i| + 5^|Pn7 iy | 

i=2 

2 74 K 5 n 

^ 1 + n + 2 24 K 2 + 3mc p/ ^ n H . 

n 
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Since no ^ 2 75 K 5 , we conclude that no #5 n/2, which when inserted 
again into the above inequality gives no ^ n — 2 75 K 5 , which is option 



(i) of Proposition 5.3 



The analysis of option (ii) goes along similar lines but is a little more 
complicated. Suppose now that one of the 7$, say 71, is an irreducible 
conic and contains at least 2 76 K 5 points of P. Once again, it also 
contains at least 2 75 K 5 points of P'. Write n := \P' D 7i|; thus n ^ 

2 75^5 

By Lemma 5.2 as before we may, for each p' G P, partition P as 
{p'} U E iy U ■ ■ ■ U S v y with ty < 2 19 (10m) 3 fSm ^ 2 61 K 4 n and 
and each Ej )P / contained in some (not necessarily irreducible) curve 7^ 
of degree at most three containing p' which is either a line, or has the 
property that each irreducible component contains at least 10m points 
of P. 

By the pigeonhole principle as before, we may find p' G P' H 71 such 
that cy ^ 2 61 K 4 n/n . Now fix this p'. 

Suppose that 7^ is not a line. Then, by Bezout's theorem as above, 
each irreducible component of 7^ is one of the 7,. Since p' G 7iy , and 
p' lies on 71 but not on any other 7^, one of the irreducible components 
of 7iy is 71. Thus for each i one of the following is true: 

(i) Sjy is contained in a line through p'\ 

(ii) is contained in the conic 71; 

(iii) Sj jP / is contained in the union of the conic 71 and a line 7^. 



Now recall Lemma 5.2, Item (iii) of that lemma asserts that in cases 
(ii) and (iii) above the points of may be divided into collinear 
triples (p',q,r). This immediately rules out option (ii). For those i 
satisfying (iii) we see that |£j iP '| = 2|E ijP / fl7i|. For those i satisfying 
(i) it follows from Bezout's theorem that |£j, p '| ^ 3m. 

Let / be the set of indices % for which Ejy is not contained in a line 
through p', that is to say for which option (iii) above holds. It follows 
that 



n 



\P\ = l + y2 \^i, P '\ < 1 + 2^ |E i)P / fl7i| + 3mc p 



However, any line through p' meets 71 (which contains p 1 ) in at most 
one other point, and so 

y, |Ejy n 71 1 < \P n 71 1 + Cp>. 

Since 

\P n7i| < \P' n 71 1 + \P\P'\ ^n + 2 U K 2 
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we conclude that 

2 u K 5 n 



n = 2n + 2 2b K 2 + (3m + l)cy + 1 ^ 2n + 

Since no ^ 2 75 K 5 , this is easily seen to imply that uq ^ n/4, and hence 
no ^ n/2 — 2 7e K b and c p / ^ 2 63 i^ 4 . In the converse direction, we have 



n ^ |S ijP /| =2^ n 7i| 



and so 



't ii 

|Pn 7 i| < - + i + |(J^yn 7 i|. 

For i $l I , Sjy fl 71 consists of at most one point, and so 

|Pn 7 i|^| + i + c p ^| + 2 76 ir 5 

and hence 71 contains between | — 2 76 K 5 and | — 2 76 J fT 5 elements of 
P. 

Looking back at the three possibilities (i), (ii) and (iii) above, we see 
that the other points lie in the union of the lines 7^ and jj, of which 
there are at most c p > + m ^ 2 6A K 4 . 

To complete the proof that we are in case (ii) claimed in the proposi- 
tion, we need to give an upper bound for the number of ordinary lines 
spanned by the set P \ 71. Such a line could be ordinary in P, but 
there are at most Kn such lines. Otherwise, such a line passes through 
a point p' G P fl 71, and contains precisely two points in P \ 71. Let 
us say that such a line is bad. The number of bad lines arising from 
p' G P \ P', a set of cardinality at most 2 24 K 2 } is at most 2 23 K 2 n. 
Suppose then that p' G P' fl 71. As above, for each such we have a 
partition P = {p'} U £iy U • • • U S c and we now know that Sjy is 
either a line through p' or is contained in the union of 71 and a line. 



Furthermore in the latter case we know from Lemma 5.2 (iii) that every 
line though p' and a point of £j jP / passes through precisely two other 
points of P, one on 71 and the other not. Therefore it is not bad. The 
number of bad lines through p' is thus at most c p /, and so the total num- 
ber of bad lines arising from p' G P fl 71 is at most Y2 P ' °pf ^ 2 61 K A n. 
Statement (ii) of the proposition follows immediately. 

We have now considered all cases in which any irreducible cubic or 
conic from amongst the m curves 7^ contains more than 2 76 K 5 points of 
P. If this is not the case, the only curves among the ji containing more 
than 2 7e K 5 points of P are lines. Thus P may be covered by m ^ 2 16 K 
lines and at most 2 76 K 5 m ^ 2 87 K 6 points, which gives option (iii). □ 
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6. Ordinary points in unions of lines 
Suppose that P is a set of n p oints spanning at most Kn ordinary 



lines. We know from Proposition 5.3 that all but 0(K°^) points of 



P lie on an irreducible cubic, an irreducible conic and some lines, or 
some lines. The aim of this section is to reduce the number of lines, in 
all cases, to 1. The main result of this section is the following theorem, 
which may again be of independent interest. 

Proposition 6.1. Suppose that a set P C IRP 2 of size n lies on a 
union l\ U • - • U £ m of lines, and that P spans at most Kn ordinary 
lines. Suppose that n ^ no(m,K) is sufficiently large. Then all except 
at most 3K of the points of P lie on a single line. 

We first handle the (easy) case where one line has almost all the 
points. For this, one does not need to know that P is contained in the 
union of a few lines. 

Lemma 6.2. Suppose that P C IRP 2 is a set of size n, that P spans at 
most Kn ordinary lines, and that at least |n of the points of P lie on 
a single line £. Then in fact all except at most 3K of the points lie on 
£. 

Proof. Let p G P\£. Then p forms at least 2n/3 lines with the points 
of P fl £. At most n/3 of these contain another point of P, and so at 
least n/3 of them are ordinary. Therefore the number of ordinary lines 
is at least \P \ £\n/3, and the claim follows immediately. □ 



m ■ 



Suppose now, and for the rest of the section, that P C £ x U • • • U t 
The opposite extreme to that considered by the above lemma is when 
all the lines li contain many points of P. The next result, which is the 



key technical step in the proof of Proposition 6.1 (and is in fact rather 



stronger than that proposition), may be of independent interest. 

Proposition 6.3. Suppose that m ^ 2 and that a set P C IRP 2 of size 
n lies on a union £\ U • ■ • U£ m of lines, and that at least en points of P 
lie on each of the lines £i. Suppose that m, | ^ nwooo. Then P spans 
at least e 6 n 2 /m 3 ordinary lines. 

Remark. The exponent could certainly be improved somewhat, 
but a really significant improvement - beyond j^, say - would require 
new methods. 

The proof of this proposition is quite long. Before embarking upon 



it we show how to derive Proposition 6.1 as a consequence. 
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Deduction of Proposition 6J_ from Proposition 6^3_ and Lemma 6J2 
Reorder the lines so that n\ ^ n2 ^ . • • ^ n m , where n« := \P H I 
Set Sj := nj/n. If £2 ^ l/3m then E\ ^ 2/3 and we are done by 



Lemma 6.2, so suppose that £2 ^ l/3m. Write Pj := Pfl (£\ U ■ • • 
j = 2,3, ... ,m. By Proposition |6.3[ the set Pj determines at least 



e 6 n 2 /m 5 ordinary lines (here we have used the trivial lower bound 
\Pj\ ^ I Pi I ^ n/m). Since \P \ Pj\ ^ m£j +1 n, the number of these 
which fail to be ordinary lines in P is bounded above by mEj + in 2 . If j 
is such that Sj + \ < (ej/m) G then it follows that the number of ordinary 
lines in P is at least \iejjrnfn 2 . Taking j ^ 2 minimal so that this 
inequality holds, it is clear that Sj ^ exp(— Ce m ), and so P spans 
3> exp(— Ce m )n 2 ordinary lines. If n ^ n (m,K) is sufficiently large 
this is greater than Kn, so we obtain a contradiction. □ 

Remark. We note that no(m, K) can be taken to have the shape 
no(m,K) ~ i^expexp (Cm). 



We may now focus our attention on establishing Proposition 6.3 We 
will divide into two quite different cases, according as the lines ii all 
intersect at a point or not. 

Proposition 6.4. Suppose that m ^ 2 and that P lies on a union 
l\ U • • • U £ m of lines, all of which pass through a point, and that at 
least en points of P lie on each of the lines t{. Then P spans at least 
e 2 n 2 /50 ordinary lines. 

Proof. We are greatly indebted to Luke Alexander Betts, a second 
year undergraduate at Trinity College, Cambridge, who showed us the 
following argument. For brevity we give a slightly crude version of the 
argument he showed us. 

Applying a projective transformation, we may assume without loss 
of generality that all the lines pass through the origin [0, 0, 1] in the 
affine part of MP 2 . Dualising, these m lines become points on the line 
at infinity, and the sets Pnlj become sets of parallel lines. If £ is a 
set of lines in IR 2 , we say that a point is ordinary for £ if it lies on 
precisely two of the lines in C. We say that £ is t-parallel if, for every 
line £ G £, there are at least t other lines parallel to it. Finally, we 
say that a point lying on three or more of the lines from £ is a triple 



point. The dual statement to Proposition 6.4 (with t replacing en) is 
then the following. 

Proposition. Suppose that £ is an t-parallel set of lines in M 2 , and 
that not all the lines of £ are parallel. Then there are at least t 2 /50 
ordinary points for £. 
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Figure 15. Figure relevant to the proof of Lemma 6.5 
Here, t = 6 and the ray £\ meets 4 > 6/2 lines in £. 
All of these points of intersection are ordinary as they lie 
outside the convex hull of the triple points of £, shaded 
in blue. 

The heart of the matter is the following lemma. 

Lemma 6.5. Suppose that £ is t-parallel, but not all the lines of £ are 
parallel. Then there is a line £ G £ containing at least t/2 ordinary 
points for £. 

Proof. If there are no triple points determined by £ then the conclu- 
sion is immediate, as every line intersects at least t+1 > t/2 other lines. 
If there are triple points determined by £, let T be the set of them. 
Let v be a vertex of the convex hull of T, lying on lines £i, £ 2 , £3 G £■ 
There are open rays (half-lines) £\ ,£3 emanating from v which do 
not intersect T. Suppose without loss of generality that the rays £f, £\ 
lie on either side of the line £2, as depicted in Figure 15 Each of the 
t other lines in £ parallel to £2 meets either £\ or , and so at least 
one of these rays meets at least t/2 lines in £. All of these points of 
intersection must, by construction, be ordinary points for £. □ 

Now let us return to the main problem, the dual form of Proposition 



6.4 stated above. Note that if £' is formed by removing at most t/5 



lines from £ then it is still 4t/5-parallel. Thus, by \t/5] applications of 



Lemma 6.5 we may inductively find distinct lines £\, . . . , £\t/S] such that 
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li contains at least 2t/5 ordinary points for C \ {l\, . . . ,li-i}. These 
are not necessarily ordinary points for C but any such point that is 
not lies on one of i\, . . . , Since it also lies on li, there are at most 
% — 1 < t/5 such points, and so li contains at least t/5 ordinary points 
of L. Each ordinary point of C lies on at most two of the lines li, so 
we get at least t 2 /50 ordinary points in total. □ 



We have now established Proposition 6.4 which is the particular case 



of Proposition 6.3 in which all the lines li pass through a single point. 



We turn now to the case in which this is not so. The next proposition, 



together with Proposition 6.4, immediately implies Proposition 6.3 and 



hence the main result of the section, Proposition 6.1 



Proposition 6.6. Suppose that m ^ 2 and that a set P C MP 2 of size 
n lies on a union l\ U • • • U l m of lines, not all of which pass through a 
single point, and that at least en points of P lie on each of the lines li. 
Suppose that m, - ^ riioooo. Then P spans 3> e 6 n 2 /m 3 ordinary lines. 

The proof proceeds via several lemmas. It also requires some additive- 
combinatorial ingredients not needed elsewhere in the paper, which we 
collect in the appendix. It is convenient, for this portion of the ar- 
gument, to work entirely in the affine plane. Let us begin, then, by 
supposing that a projective transformation has been applied so that all 
lines li and their intersections lie in the affine plane 1R 2 . 

Suppose that I is a line. Then by a ratio map on I we mean a map 

ip = <i[) m , :^1U {oo} 

of the form 

= length (pq) 
q ' q length(pg') ' 

where q, q' are distinct points on I and the lengths length(pg), length(pg') 
are signed lengths on I. We say that the ratio maps ifj q ^ and i/j q > iq are 
equivalent, but otherwise all ratio maps are deemed inequivalent. Note 
that such ratio maps implicitly appeared in the analysis of the infinite 



near-counterexample ( 2.2 ) . 

An ordered triple of lines li,lj,lk not intersecting in a single point 
defines two ratio maps 4>ij t k : ^ — >■ 1R U {oo} and 4>ij,k '■ lj ~ > K U {oo} 
via 

4>i,j,k '■= ipiiC\ij,tiC\i k and 4>i,j,k '■= ^n^/jn^- 

We will make considerable use of these maps in what follows, as well 
as of the following definition. 
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Definition 6.7 (Quotient set). Suppose that X C R U {00} zs a set. 
Then we write Q(X) for the set of all quotients X\jxi with X\,X2 G X 
and xi, X2 {0, 00}. 

Definition 6.8. Let A be a finite subset of some line i, and let ip be a 

ratio map on i. Then we say that A is a ip-grid if A is a union of at 
most nso sets S such that \Q(il>(S))\ ^ n l+ w . We say that A is a grid 
if it is a ip-grid for some ratio map ip on £. 

Lemma 6.9. Let £ be a line, and suppose thatip andip' are inequivalent 
ratio maps on £. Suppose that A is a ip-grid and that A' is a ip' -grid. 
Then \A fl A'\ < n 1- ^. 

Proof. Without loss of generality we may assume that i is the x- 
axis parametrised as {{t, 0) : t G R}, and so ip(t) — (t + a)/(t + b), 
f{t) = {t + a')/[t + V) with ajtb,a'^b' and {a, b} ^ {a', b'}. 

Suppose that A is a ip-grid and that A' is a ip'-grid. Write A = 
U£i 30 Si and A' = U"=i° S> where \Q(^(Si))\ and \Q(i/>'(S^))\ are both 

at most n 1+ To. It suffices to show that \S{ fl S'A -C n^s . 

Suppose, then, that X C S.nS'j. Then \Q(ip(X))\ and \Q{ip'{X))\ 

are both at most n 1+ w. Writing Y := log?/>(X), it follows that 

|y-y|,|/(y)-/(y)| (e.i) 

where /(?/) = logo?// o ip~ l o exp(y) = 

= log((6 - a')e y + a'-a)- log((6 - &V + b' - a) 
= log{Ae y + B)- \og{A'e y + B'), 

say, and /(V) := {f(y) '■ y G Y, f(y) is defined}. Note that we do not 
have AA' = and BB' = (there are four cases to consider). We thus 
compute 

_ e y (b - a)(b' - a')(AA'e 2y - BB') 
i ^ ~ [A e y + B) 2 {A' e y + B') 2 ' 

This is continuous except when e y = —B'/A' or —B/A, and nonzero 
except when e 2y = BB' /AA' . It follows that R may be split into at 
most 4 pieces on which / is defined and strictly concave/convex. By 



Theorem A. 8 this implies that at least one of \Y — Y\, \f(Y) — f(Y)\ 



has size 3> |y| 5 / 4 . Comparing with (6.1) we see that \X\ = \Y\ <C n^>. 



The result follows. □ 



Ratio maps may be used in understanding how lines intersect as a 
consequence of Menelaus's theorem, illustrated in Figure [6j 
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Figure 16. An illustration of Menelaus's theorem. The 
lengths are signed. 

Lemma 6.10. Let £i,£j,£ k be three lines not meeting at a point. Let 
Xi C and Xj C £j, and let T C X{ x Xj be a set of pairs, with neither 
Xi nor Xj containing £{ fl £j. Let X k C £ k be the set of points on £ k 
formed by intersecting the line {xi,Xj}, (x^Xj) G V, with £ k . Then 

\x k \ = \{4>i,jA x i)/4>i,jA x j) '■ { x u x i) e r}|. 

Proof. Apply Menelaus's theorem with AC = £i, AF = ij and 
CD = £ k . Suppose that x^ G £% and Xj G £j, and write E = x iy 
F = Xj in the diagram. Then D is the point at which {xi,Xj} in- 
tersects 4- Note that (j>i,j,k{xi) = EA/EC, 4>i,j,k{xj) = FA/FB. By 
Menelaus' theorem it follows that <pij^{xi) / ' <pi,j,k{xj) = DB/DC. This 
ratio uniquely determines the point D, and the lemma follows. □ 

Lemma 6.11. Suppose that P is a set of n points lying on a union 
£\ U • • • U £ m of lines, that the lines 4 are not all concurrent, and that 
at least en points of P lie on each line £%. Suppose that m, - ^ nioooo. 
Then either P spans at least e 2 n 2 /8 ordinary lines, or else there are at 
least two values of i such that PC\£i contains a grid with size 3> e 2 n/m. 

Proof. By the dual version of the Sylvester-Gallai theorem, there is 
some pair of lines £i,£j such that no other line passes through £i fl £j. 
Each of £i, £j contains at least en points of P, at least en — 1 ^ en/2 of 
which are not the intersection point £i(l£j. If P spans fewer than e 2 n 2 /8 
ordinary lines then there is some k such that for at least e 2 n 2 /8m pairs 
Pi G £i,Pj G £j (pi,Pj 7^ £i H £j) the line {pi,Pj} meets £ k in a point 
of P. Write Xi ■= (£ i nP)\ {U n£j), Xj := [£j D P) \ H £j) and 
rci,x Xj for the set of pairs (pi,Pj) G Xi x Xj for which {pi,Pj} 
meets £ k in a point of P. By Lemma |6.10| it follows that 

n ^ \X k \ ^ \{<j>ij,k(xi)/<jkijjk(xj) : (xi,Xj) G 
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By Corollary A. 2 and the hypothesis on m and | it follows that there 
are sets X- C X h Xj C Xj with |X-|, |Xj| > e 2 n/m and 

|e(0u,fcM)|, |e(#u t fc(A}))| « < n 1+ ^>. 

Thus certainly X] is a 0j Jjfc -grid and X'- is a ^jj^-grid. □ 



As a result of this lemma we may, in proving Proposition 6^3 , restrict 
attention to sets where Pn£i contains a large grid for at least two values 
of i. We study this situation further in the next lemma, whose proof 
is a little involved. 



Lemma 6.12. Suppose P is a set of n points lying on a union of 
lines t\ U • • • U £ m , where m ^ nioooo. Suppose that i ^ j and that 
Xi C P n li, Xj C P fl £j are grids, both of size at least e'n, where 
e' 3> n~Tooo ; and neither containing £i fl Taen either P contains 
3> (e') 3 n 2 ordinary lines, or else there is a line £k, not passing through 
£if]£j, and a grid X^ C PC\£k, such that all but at most e'\Xi\\Xj\/25 
pairs (xi,Xj) G Xj x Xj are swca £aa£ £ae line {xi,Xj} meets £k in & 
point of Xfe. 

Proof. Write rj := e'/lOO. Suppose that P does not contain at 
least (£') 3 n 2 /100 = r](e'n) 2 ordinary lines. Then there are at least 
(1 — n)|Xj||Xj| pairs (xj,Xj) G Xj x Xj such that {x^Xj} meets some 
other line t\~. Write r& C Xj x Xj for the set of pairs (xj, Xj) G Xj x Xj 

such that {xi,Xj} meets £k in a point of P. Thus \Tk\ ^ (1 — 
r/)|Xj||Xj|. We claim that there is at most one value of k for which 
^ r/|Xj||Xj|/m, and that for this k (if it exists) the line £k does 
not pass through £if]£j. Note that if \Tk\ ^ r]\Xi\\Xj\/m then certainly 
\T k \ ^ <5n 2 , where 5 = n -1 / 250 . 
This our claim is a consequence of the following two facts. 
Fact 1. If £ k passes through ^ fl £j then |r fc | < <5n 2 . 
Faci If we have two lines £k,£k', neither passing through £i fl £j, 
then at least one of |IV| has size at most 5n 2 . 

Proof of Fact 1. Suppose that Ik passes through £i fl £j, but that 
\Tk\ ^ Sn 2 . Here (and in the proof of Fact 2 below) we apply an affine 
transformation so that £i is the x-axis and £j is the y-axis. Suppose that 
£k is the line {(t, Xkt) : t G R}. Suppose that Xj = {(a, 0) : a G A} and 
Xj = {(0, 6) : b G -B} and, by slight abuse of notation, identify Tk with 
a subset of Ax B in the obvious way. A short computation confirms that 
the intersection of the line through (a, 0) G £{ and (0, b) G £j with ^ is 

there are sets A' <Z A 



the point { 1/x + Xk/y , i/ x +x k /y )- B Y Corollary 



A.2 



and B' C B with \B'\ > on and |^ - Jp\ < o" -11 ™ < n 1+ ^o. Since 
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Xi is a grid, there is some ratio function ip(t) = (t + a) / (t + (3) , a ^ (3 

such that A' C S i: where \Q(ip(Si))\ < n 1+ n> for each i. Let A' 

be the largest of the intersections A' D 5^; then 



10 



1 A" A" 

Writing Y := 1/A" and /(?/) := log(l + ay) — log(l + f3y) we thus have 
|y| > dn 1 -^, \Y - Y\, \f(Y) - f(Y)\ < (6.2) 

However 

/3 2 a 2 2a/3y - (a + /3) 



[l + f3y) 2 (l + ay) 2 v ^(l + m/) 2 (l + /32/) 2 

is continuous away from y = —1/a and y = —1//3 and has at just 
one real zero, and therefore one may divide K into at most 4 intervals 
on the interior of which / is defined and strictly convex/concave. By 
Theorem A. 8 it follows that one of \Y — Y\, \f(Y) — f(Y)\ has size 
3> |y| 5 / 4 . This comfortably contradicts (6.2) for large n. 

Proof of Fact 2. Suppose that |IV| ^ Sn 2 . By Lemma 6.10 and 
the fact that neither l k nor £ k i contains more than n points we have 

\{4>i,j,k{ x i)l<t>i,jA x i) '■ ( x h x j) e r fc)l < n 

and 

\{(j>ij jk '{ x i)/<i>i,j,k'{xj) ■ i x i, x j) e r fc /}| < n. 
Applying Corollary A. 2 exactly as before we deduce that there are sets 
X\ k \xf ] C Xi and sets xf\xf' ] C X jt all of size > 5n > n 1 "*, 
such that all of |Q(0 M - fc (xf >)|, |Q(0 i)i)fe ,(xf >))|, |Q(<^(xf } ))| and 
|Q(0 lJifc /(x] fe,) ))| have size < <T 12 n < n 1+ ^>. Thus x[ k) is a iijjfc - 

grid, X[ k ) is a <j> i}j]k ,-gnd, X {k) is a ^j^-grid and xf ) is a iii)fc /-grid. 
Since Xi and Xj are themselves grids, and since grids corresponding 
to inequivalent r atio functions intersect in a set of size no more than 



n 1_ 25 by Lemma 6.9, we see that (j)ij,k ~ 0ij,fe' an d 4>i,j,k ~ 4>i,j,k'- 

It follows immediately from the definition of 4>i > j > ki4 > i,3,k'i ( t > i,j,ki4 > i,o,k' 
that 4n4 = ty and ^ n £fc = £j H and therefore £fc = This 
is contrary to assumption, and so we have established Fact 2. 

This completes the proof of the claim. It follows that there is some k 
such that at least (1 — 2?7)|Xj||Xj| pairs X{ £ Xi,Xj £ Xj are such that 
{xi, Xj} meets P fl £ k , and furthermore £ k does not pass through ^ n 
If Xfc £ PC\£ k , we say that is well-covered if there are at least r](e') 2 n 
pairs (xj,Xj) £ Xj x Xj such that {a^, passes through x k . Since 
l-P H £k\ ^ n, the number of pairs (x i; Xj) used up by poorly-covered x k 
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is no more than r)(e') n ^rj\Xi\\Xj\. Thus, for at least (1 — 3^)1 Xj||X. 



j i 



pairs (xi, Xj), the line {xi, Xj} meets £ k in a well-covered point x k . Write 
X k C P n tk for the set of well-covered points; we are going to show 
that there is a grid X k occupying almost all of X k . 

Note that, by definition of well-covered, for any Y C X k there is 
a graph r C Y x Xj, |T| ^ r7(V) 2 |F| |Xj|, such that each line {y, Xi} 



meets Xj whenever (y, Xi) G T. By Lemma 6.10 and the trivial bound 
\Xj\ ^ n, this implies that 

S uppo se that \Y\ is r/n. Then |T| ^> n 2 {e') i n 2 ^> n 2 ~2k. By Corol- 



lary [A2| there are sets and X- C Xj, |X-| > n 1- ™, such 

that iQC^ijjfe^'))! < ^ 1+ «>. Applying this argument repeatedly, we 
see that all of X k except for a set of size at most nn can be covered by 
disjoint sets Y' with these properties. There are at most Cnwo < 
of these sets, and so the union of them, X k say, is a ^^-grid. 



Finally, note that the number of pairs (xi,Xj) for which {xi,Xj} 
passes through X k \X k is at most n\X k \X k \ ^ nn. For all other pairs, 
{xi,Xj} passes through the grid X k . 

At last, this completes the proof of the lemma. □ 



We are now in a position to complete the proof of Theorem 6J3 and 
hence of all the other results in this section. 



Proof of Theorem \6.6] Suppose that P lies t\ U • • • U£ m , with at least 
en points on each line and not all of th e line s through a single point. 
Suppose that m, - ^ nioooo. By Lemma 



6.11 



we are done unless there 
are two values i,j G {1, . . . , m} such that P n it, P PI £j both contain 
grids of size at least e'n, with e' e 2 /m > n~iooo. Suppose without 
loss of generality that £i contains the largest grid amongst all grids in 
P; call this Xj. Let Xj be a grid in PC\£j of size at least e'n, and apply 
Lemma |6.12| to these two grids Xj,Xj. If the first conclusion of that 



lemma holds then P contains ^> (e') 3 n 2 ^> e 6 n 2 /m 3 ordinary lines, and 
we are done. Otherwise there is a line £ k , not passing through ^ n £j, 
and a grid X k C P fi £ k , such that for all Xj) in a set T of size at 
least (1 — 4?7)|Xj||Xj|, the line {xi,Xj} meets £ k in a point of X k . 
By Lemma [6.10| we have 

\{<l>i,j,k{xi)/4>i,j,h{xj) ■■ (xi,xj) g r}| < \x k \. 
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It follows from Lemma IA.3I that 

\X k \ > \X i \ + \X J \-2-^2r 1 \X l \\X J \ > \XA, 

contrary to the assumption that Xj was a grid in P of largest size. □ 

In this rather long section, we have proved results about sets P, con- 
tained in a union l\ U • • • U £ m of lines, spanning few ordinary lines. 
They are plausibly of independent interest. Let us now, however, re- 
turn to our main task and combine what we have established with the 



main results of previous sections. By combining Proposition 6.1 with 



Proposition 5.3 we obtain the following structural result. From the 



qualitative point of view at least, this supersedes all previous results in 



the paper (and, in particular, implies Proposition 1.4 as a corollary) 



Proposition 6.13. Suppose that P is a set of n points in RP 2 for 
some n ^ 100 spanning at most Kn ordinary lines, where 1 ^ K ^ 
c(log log n) c for some sufficiently small absolute constant c. Then P 
differs in at most 0(K°^) points from a subset of a set of one of the 
following three types: 

(i) A line; 

(ii) The union of an irreducible conic and a line; 
(in) An irreducible cubic curve. 



Proof. We apply Proposition 5.3 In case (i) we are already done. 



In cases (ii) and (iii) we see that, after removing an irreducible conic 
if necessary, that we have ^ n/2 — 0(K°^) points on Q(K° ^) l ines 
determining at most 0(K° ( - 1 ^)n ordinary lines. By Proposition 



6.1 



and 

the hypothesis K ^ (loglogn) c , all but 0(K°^) of these points lie on 
a single line, and the claim follows. □ 



7. The detailed structure theorem 
We turn now to the proof of the detailed structure theorem, Theo- 



rem 



1.5 We first establish a slightly weaker version in which the linear 



bounds O(K) have been relaxed to polynomial bounds 0(K°^). This 
somewhat weaker statement is already sufficient for our main applica- 
tion, the proof of the Dirac-Motzkin conjecture for large n. At the end 
of this section we indicate how to recover the full strength of Theorem 
PI 



Theorem 7.1. Suppose that P is a finite set of n points in the pro- 
jective plane MP 2 . Suppose that P spans at most Kn ordinary lines 
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for some K ^ 1, and suppose that n ^ expexp(CK c ) for some suf- 
ficiently large absolute constant C . Then, after applying a projective 
transformation if necessary, P differs by at most 0(K ^') points from 
an example of one of the following three types: 

(i) n — 0{K°W) points on a line; 



ii) The set X2 m defined in (1.1) for some m = |n — 0(K ^' 



(iii) A coset H © x, 3x G H, of some finite subgroup H of the real 
points on an irreducible cubic curve with H having cardinality 
n + 0(K°V>). 



We now prove this theorem. We already know from Proposition 6.13 
that a set with at most Kn ordinary lines must mostly lie on a line, 
the union of a conic and a line, or an irreducible cubic curve, and the 
proof of Theorem L5] proceeds by analysing the second two possibilities 
further. 

The key to doing this is the fact that collinearities on (possibly re- 
ducible) cubic curves are related to group structure. This is particularly 
clear in the case of an irreducible cubic, as we briefly discussed in Sec- 
tion [2] . However one can see some group structure even in somewhat 
degenerate cases. 

An important ingredient in our analysis is the following result of 
a fairly standard type from additive combinatorics, which may be 
thought of as a kind of structure theorem for triples of sets A, B, C 
with few "arithmetic ordinary lines" . 



Proposition A.4[ Suppose that A, B, C are three subsets of some 



abelian group G, all of cardinality within K of n. Suppose that K 
o(n). Suppose that there at most Kn pairs (a,b) G A x B for which 
a + b C . Then there is a subgroup H ^ G and cosets x + H,y + H 
such that \AA{x + H)\, \BA(y + H)\, \CA(x + y + H)\^7K. 

This result will not be at all surprising to the those initiated in 
additive combinatorics, but we do not know of a convenient reference 
for it. We supply a complete proof in Appendix |A~1 

Suppose that P is mostly contained in a cubic curve 7 which is not a 
line. We subdivide into two cases according to whether 7 is irreducible 
or not. 

Lemma 7.2 (Configurations mostly on an irreducible cubic). Suppose 
that P is a set of n points in MP 2 spanning at most Kn ordinary lines. 
Suppose that all but K of the points of P lie on an irreducible cubic 
curve 7, and suppose that n ^ CK for a suitably large absolute constant 
C . Then there is a coset H © x of 7 with 3x = x(Bx(Bx&H such 
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that \PA(H (B x)\ = 0(K). In particular, 7 is either an elliptic curve 
or an acnodal cubic. 

Proof. Let 7* be the smooth points of 7, which we give a group law 
as in Section [2] 

Set P' := Pn 7*. Then \P'\ = \P\ + O(K), and P' spans at most 
0(Kn) ordinary lines. If p±, P2 G 7* are distinct then the line joining p\ 
and P2 meets 7* again in the unique point Qpi Q P2- This assumption 
implies that Qpi Qp2 G P' for all but at most 0(Kn) pairs pi,P2 G P'. 
Applying Lemma A.4[ it follows that there is a coset H © x of 7* such 



that \PA(H®x)\ = O(K) and also \PA(HQ2x) \ = 0(K). From this 
it follows that \(H ®x)A(H Q2x) \ = O(K), which implies that 3x G H 
if n ^ CK is large enough. 

If 7 is not an elliptic cuvre or an acnodal cubic then the group 7* 
is isomorphic to R or to R x Z/2Z, and neither of these groups has a 
finite subgroup of size larger than 2. □ 

We turn now to the consideration of sets P which are almost con- 
tained in the union of an irreducible conic a and a line £. The union o~U£ 
is a reducible cubic and so does not have a bona fide group law. There 
is, however, a very good substitute for one as the following proposition 
shows. In what follows we write a* :— a \ (cr H £) and £* :— £ \ (a H £). 
Note that the intersection a D £ has size 0, 1 or 2. 

Proposition 7.3 (Quasi-group law). Suppose that a is an irreducible 
conic and that £ is a line. Then there is an abelian group G = G a ^ 
with operation © and bijective maps tp a : G — > a*, tpe : G — > £* such 
that ip a (x),ip a (y) and ipi(z) are collinear precisely if x © y © z = 0. 
Furthermore G a ^ is isomorphic to Z/2Z x R if \a D £\ =2, to R if 
\a n £\ = 1 and to R/Z if \a C\£\ = 0. 

Proof. This is certainly a known result, but it is also an easy and fun 
exercise to work through by hand, as we now sketch. If \a H £\ = 2, 
we may apply a projective transformation to move the two points of 
intersection to [0,0,1] and [0,1,0], and a* to the parabola {[a,a 2 ,l] : 
a G R x } and t to {[0,-6,1] : b G R x }. We note that [ai,af,l], 
[a 2 , a|, 1] and [0, b, 1] are collinear if and only if b = aia 2 - 

Consider the maps ipe : Z/2ZxR — > £* defined by ipe(e, x) = [0, —6, 1] 
where b = (— l) e e~ x and ip a : Z/2Z x R — > a* defined by ip (7 (e,x) = 
[a, a 2 ,l] where a = (— l) £ e x . Then we see that the maps ipi,ip a are 
bijections and that the claimed collinearity property holds. 

Now suppose that we are in case (ii), that is to say \a fl £\ = 1. 
Applying a projective transformation, we may suppose that the point 
of intersection is [0, 1, 0] and move a* to the parabola {[a, a 2 , 1] : a G R} 
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and t to the line at infinity {[1, — b, 0] : b G R}. Now note that [a±, of, 1], 
[a2, a|, 1] and [1, —6, 0] are distinct points in P' that are collinear, then 



a\ + ci2 + & = (cf. the near-example (2.4)). 

Finally suppose that cr and £ do not intersect. Applying a projective 
transformation we may map t to the line at infinity { [sin 7r8, cos tt6, 0] : 
G R/Z}. As cr is disjoint from £ it must be a compact conic section 
in R 2 , that is to say an ellipse. By a further affine transformation we 
may assume that it is in fact the unit circle {[cos27r#, sin27r#, 1] : 6 G 
R/Z}. By elementary trigonometry it may be verified that the points 
[cos27rai, sin27rai, 1], [cos27ra2, sin27ra;2, 1] and [sin7r/3, cos7r/3, 0] are 
collinear if and only if a\ + ct2 + (3 = 0, thus in this case the result is 
true with ip a {9) = [cos27r#, sin27r#, 1] and ipe(6) = [sin tt6, cos tt6, 0]. □ 

We may now derive the following consequence, analogously to Lemma 

Lemma 7.4 (Conic and line). Suppose that P C MP 2 zs a se£ of n ^ 

no(K) points, all except K of which lie on a the union of an irreducible 
conic a and a line i. Suppose that P defines at most Kn ordinary 
lines, and suppose that P has n/2 + O(K) points on each of a and £. 
Then, after a projective transformation, P differs from one of the sets 
X n i by at most 0(K) points. 

Proof. Set P a := P n cr* and P e := P n t. Then \P a \ + \P e \ = 
\P\ + O(K), and P' spans at most O(Kn) ordinary lines. Consider 
also the pull-backs A := ip^^Pi) and B := ■?/;~ 1 (P (J ), where ipi,ipa are 
the "quasi group law" maps introduced in the preceding proposition. 
Both A and B are subsets of G a £, a group for which there are three 



possibilities, detailed in Proposition 7.3 



The assumption about ordinary lines implies that Qa\ 02 G B for 
all but at most O(Kn) pairs ai,02 G B. Applying Lemma A. 4 it 
follows that there is a subgroup H ^ G 7 and cosets x ® H, —2x © 
such that \AA(x © \BA(-2x ® H)\ = 0{K). 

If n ^ CK for large enough C then it follows that \a D ^| =0 since 
(with reference to the three possibiities for G a £ described in Proposition 



7.3) neither Z/2Z x R nor R has a subgroup of size larger than 2. 



Applying a projective transformation, we may assume that t is the line 



at infinity and, as in the proof of Proposition 7^ that cr is the unit 
circle. We may apply a further rotation so that x = 0, that is to say 
\AAH\ = 0{K) and \BAH\ = 0(K). 

All finite subgroups of R/Z are cyclic and so we have H = {j/m : 
j G {0, 1, ... ,m — 1}} for some m, that is to say H consists of the 
(additive) m th roots of unity. But then ip a (H) U ipe(H) is precisely the 
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set X n i = X 2m described in the introduction and in the statement of 
Theorem 11.51 □ 



Putting Lemmas 7.2 and 7.4 together with the main result of the 



previous section, Proposition |6.13 we immediately obtain Theorem 7A_ 



The remainder of this section is devoted to establishing our most precise 
structure theorem, Th eore m |1.5| Let us remind the reader that this is 
the same as Theorem 7.1, only the polynomial error terms 0{K°^) 



are replaced by linear errors O(K). The reader interested in the proof 
of the Dirac-Motzkin conjecture for large n may proceed immediately 



to the next section, where Theorem 7.1 is already sufficient. 



Proof of Theorem L5_, The converse claim to this theorem already 
follows from the analysis in Section [2j so we focus on the forward claim. 
We may assume that the constant C is sufficiently large. We then apply 



Theorem 7.1 to obtain (after a projective transformation) that P differs 
by 0{K°^) points from one of the three examples (i), (ii), (iii) listed. 
Our task is to bootstrap this 0(K°^) error to a linear error O(K). 

Suppose first that case (i) holds, thus all except 0(K°^) points of 
P lie on a line i. Then every point p in P that does not lie on I forms 
at least n - 0{K 0{ $) lines with a point in P n t. At most 0(K°^) of 
these can meet a further point in P, so each point in P\£ produces at 
least n — 0(K°^) ordinary lines connecting that point with a point 
in P n t. We conclude that the number of ordinary lines is at least 
(n — 0(K°^))\P\£\; since there are at most Kn ordinary lines, we 
conclude that \P\£\ = O(K), and the claim follows. 

Now suppose that case (ii) holds, thus P differs by 0(K°^) points 
from for some m = |n — 0{K°^). To analyse this we need the 
following result, essentially due to Poonen and Rubinstein 



Proposition 7.5. Let U n C C = 1R 2 denote the regular n-gon consist- 
ing of the n th roots of unity. Then no point other than the origin or 
an element ofU n lies on more than C lines joining pairs of vertices in 
Il n , for some absolute constant C . 



Actually, in [30] it was shown that C could be taken to be 7 when 
one restricts attention to points inside the unit circle. The case of 
points outside the unit circle was not directly treated in that paper, 
but can be handled by a variant of the methods of that paper. See 
Appendix |B] for details. For the purposes of establishing Theorem L5 



the full strength of Proposition 7.5 is unnecessary. Indeed, the more 



elementary weaker version established in Proposition B.2 would also 
suffice for this purpose. 
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Corollary 7.6. Suppose that p G MP 2 does not lie on the line at in- 
finity, is not an m th root of unity and is not the origin [0,0, 1]. Then 
at least 2m — 0(1) of the 2m lines joining p to a point of X2 m pass 
through no other point of X2 m - 



Proof. If x G X 2 m then we say that x is bad if the line {p, x} passes 
through another point y G X 2m , y ^ x. Suppose first that x is an m th 
root of unity. We claim that if x is bad then px passes through another 
m th root of unity, different from x, or else {p, x} is tangent to the unit 
circle. If y is already an m th root of unity then we are done; otherwise 
y is one of the m points on the line at infinity. But then the line {x, y} 
passes through another m th root of unity x' unless it is tangent to the 
unit circle, and we have proved the claim. 

This enables us to count the number of bad x which are m th roots of 
unity. There are at most two coming from the possibility that {p, x} is 
tangent to the unit circle. Otherwise, {p, x} contains another m th root 
of unity x', whence p lies on the line {x,x'}. This gives at most 0(1) 



further possibilities by Proposition 7.5 



Now suppose that x is one of the m points on the line at infinity. 
If {p, x} passes through an m th root of unity y then it passes through 
another such root of unity y' unless {p, x} is tangent to the unit circle. 
There are at most 2 points on the line at infinity corresponding to 
the tangent lines, and then at most 0(1) corresponding to t he ch ords 



{y, y'} on which p lies, by another application of Proposition |7.5[ □ 
We return now to the analysis of case (ii). Let p be a point of P not 



on either the unit circle or the line at infinity. Then by Corollary 7.6 
only 0(K°^) of the n — 0(K°^>) lines connecting p with X m , also 
meet another element of X m . As P only differs from X m by 0(K°^) 
points, we conclude that there are n — 0(K°^) ordinary lines of P 
that connect p with an element of X m . As in case (i), this implies 
that there are at most O(K) points of P lying outside the union of the 



unit circle and the line at infinity. Applying Lemma 7.4, we obtain the 
claim. 



Finally, we consider the case (iii). By Lemma 7.2, it suffices to show 
that there are at most O(K) points of P that do not lie on the curve 
E, which is either an elliptic curve or the smooth points of an acnodal 
singular cubic curve. By the same argument used to handle cases (i) 
and (ii), it then suffices to show that each point p in P\E generates 
^> n ordinary lines in P. 

For this, it suffices to establish the following lemma. 
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Lemma 7.7. Suppose that E is an elliptic curve or the smooth points 
of an acnodal singular cubic curve and that H @x is a coset of a finite 
subgroup of E of size n > 10 4 . Then, if p ^ E is a point, then there are 
at least n/100 lines through p that meet exactly one element of H © x. 

Remark. The constant 1/100 could be improved a little by our meth- 
ods, but we have not bothered to perform such an optimisation here. 
It seems reasonable to conjecture, in analogy with the results in |30j . 
that in fact there are only 0(1) lines through p that can meet three 
elements of a coset x © H of a finite subgroup of an elliptic curve, but 
this would seem to lie far deeper. 

Proof. We first exclude one degenerate case, in which E is the smooth 
points of an acnodal singular cubic curve, and p is the isolated (i.e. 
acnodal) singular point of that curve. In this case, any line through p 
meets exactly one point of E, and the claim is trivial. Thus we may 
assume that p does not lie on the cubic curve that contains E. 

Note^ that there are at most 6 lines through p which are tangent 
to E, and that any other line through p that meets two elements of 
H © x will meet a third element of H © x. Thus, it suffices to show 
that there are ^ O.Oln > 0.005n + 6 lines through p that meet at least 
one element of H © x, but do not meet three such elements. 

Suppose this were not the case. Then at least 0.99n elements q G 
H © x will have the property that the line {p, q} meets two other 
elements of H ffi x. 

As a topological group, the cubic curve E is isomorphic to either R/Z 
or R/Z x (Z/2Z). Consider all the lines through p that are tangent to 
E; there are at most 6 such lines, and the points of tangency partition E 
into at most 7 connected open sets A\, . . . , A7 (topologically, these are 
either arcs or closed loops), plus 6 endpoint vertices. From a continuity 
argument, we see that for each i one of the following statements is true: 

(i) the lines connecting p to points of Ai do not meet E again; 

(ii) there exist Aj,A^, distinct from each other and from Ai, such 
that any line connecting p and a point in A4 meets E again, 
once at a point in Aj, and once at a point in A^. 

Suppose that i is of type (i). Then we may assume that \Ai D (H ffi 
x)\ < 0.005n, since all the lines from p to Ai R (ffffix) contain no other 
point of E. By the pigeonhole principle there is some % of type (ii) with 
\Ai fl (H © x)\ > h(l — 0.035)n > O.ln. By property (ii), lines from p 
through Ai meet the curve E again in Aj and A^. 



The points of tangency must all lie on the intersection of E with its dual curve 
with respect to p, which has degree 2; see also the Pliicker Formula. 
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We are assuming that for all except at most O.Oln elements q of 
Ai n (if © x), the line {p, q} meets Aj and Ak at elements of if © x. It 
is easy to conclude from this, and similar statements for j, k, that the 
sizes of Ai n (if © x), Aj D (if © x) and A fc n (if © x) differ by at most 
0.02n. 

Let 4>ij : Ai — >■ be the map that sends a point g in Aj to the point 
{p, q}C)Aj, then 0^ is a homeomorphism from the set Ai to the set Aj; in 
particular, 0^- is either orientation-preserving or orientation-reversing, 
once one places an orientation on both Ai and Aj. Furthermore, <pij 
maps all but at most O.Oln of the elements of A,n(ff ffix) to AjC\{H@x) 
and vice versa. Now as if is a subgroup of E, which as an abelian 
topological group is either R/Z or R/Z x (Z/2Z), we see that the sets 
Aj fl (if © x), Aj fl (if © x) are arithmetic progressions in that group 
with a common spacing ft. 

Now for all but at most O.Oln values of y G Ai fl (H © x), <pij maps 
y to a point of Aj fl (if © x). For all but at most 1 + O.Oln values of 
y G Ai fl (if © x), 0y maps y © /i to a point of fl (if © x). Of the 
values of values of y satisfying both of these statements, for all but at 
most 0.02n values we have 

<pij{y ® h) = ^{y) ® h' (7.1) 

for h' equal to either h or Qh (depending on whether <f>^ is orientation- 



preserving or orientation-reversing). Thus (7.1) holds for all except at 
most 0.05n values of y G Ai fl (if © x). 

Similarly, defining <pik in exactly the same way as 0y, we see that for 
all except at most 0.05n elements y in Ai n (if © x) we have 

My®h) = (f)i k (y) @h" (7.2) 

for h" equal to either h or Qh. 

Thus we may find y G (x © if ) D A\ such that both (|7T| and ( J7T2| ) 
hold. On the other hand, as y,<pij(y),<pik(y) are collinear, we have 

y®Mv)®Mv) = o 

and similarly 

y © h © ^-(y © h) © ife (y © /i) = O. 
From these equations and ( 7.1[ ), ( 7.2[ ) we conclude that 



h®h'®h" = O. 

Since ft/, ft" are equal to either ft or ©ft, we conclude that ft has order at 
most 3, and so \A { fl (ff © x)| ^3. However we have already observed 
that \Ai n (if © x)| ^ O.ln > 100, a contradiction. □ 
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The proof of Theorem 1.5 is now complete. 



8. The Dirac-Motzkin conjecture 

The Dirac-Motzkin conjecture is the statement that, for n large, a 
set of P points in M 2 not all lying on a line spans at least n/2 ordinary 
lines. The main result of this paper is a proof of a more precise version 



of this for large n, Theorem 2.2, together with a characterization of the 



extremal examples. We prove the even more precise Theorem |2.4[ of 
which Theorem |2.2| is an easy consequence, in this section. We refer 
the reader to Section [2] for a precise statement of these two results and 
a leisurely discussion of the relevant examples. 

Suppose that P spans at most n ordinary lines and that P is not 



collinear. We may apply our main structure theorem, Theorem 1.5 
to conclude that P differs in 0(1) points from one of three examples: 
points on a line, a set X n >, and a coset of a subgroup of an irreducible 
cubic curve (Sylvester-type example). In fact the weaker and rather 



easier Theorem 7^ suffices for this purpose. 

It is obvious that the first type of set spans at least n — 0(1) ordinary 
lines. Sets close to a Sylvester example are also relatively easy to 
handle. 

Lemma 8.1. Suppose that P C MP 2 differs in K points from a coset 
H © x of a subgroup H of some irreducible cubic curve, where 3x = 
X © X © X G H . Then P spans at least n — 0(K) ordinary lines. 

Proof. Write ho := 3x thus ho G H. For every h G H, the line 
joining h © x and (—2h ho) © a; is tangent to 7 at h © x, since 
(h ffi x) © (h © x) © (— 2h Qh Q)x) = 0. Therefore it is an ordinary line 
unless 3h © h = 0, in which case the points h © x and — 2h h © x 
coincide. Thus the only points of H © x not belonging to an ordinary 
line spanned by if ©a; correspond to the points of H with 3h(Bh = 0. 
Since H is isomorphic to a subgroup of either R/Z or Z/2Z x R/Z, 
there are no more than 3 of these. It follows immediately that any 
set formed by removing at most K points of P' has at least n — O(K) 
ordinary lines, and these are all tangent lines to 7. No point outside 
of 7 lies on more than 3 tangent lines to 7, and so the addition of 
such a point destroys no more than 3 of our n — O(K) ordinary lines. 
It follows that P itself spans at least n — 0(K) ordinary lines, as we 
wanted to prove. □ 

Combining this lemma with the remarks just preceding it, we have 
now established the existence of an absolute constant C such that a 
set of n points, not all on a line, and spanning at most n — C ordinary 
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lines, differs in 0(1) points from a set Xi m consisting of the m th roots 
of unity plus m corresponding points on the line at infinity. Now the 
m tangents to the unit circle at roots of unity pass through only one 
other point of X 2m , and so X 2m has m ordinary lines. Furthermore it 
is clear that the addition/deletion of 0(1) points does not affect more 
than 0(1) of these lines. This already establishes a weak version of the 
Dirac-Motzkin conjecture: every non-collinear set of n points spans at 
least n/2 — 0(1) ordinary lines. 

To prove Theorem |2.4 a much more precise result, we must analyse 



configurations close to X 2m more carefully. What is needed is precisely 
the following result which, together with what we have already said in 



this section, completes the proof of Theorem |2.4| Recall from Section 
[2] the examples of Boroczky. 

Proposition 8.2. There is an absolute constant C such that the follow- 
ing is true. Suppose that P differs from X 2m in at most K points, and 
that P spans at most 4m — CK ordinary lines. Then P is a Boroczky 
example or a near-Bdrdczky example. 

We now prove this proposition. Suppose that P differs from X2 m in 
at most K points. Suppose first of all that P contains a point p outside 
X 2m , that p does not lie on the line at infinity and that p ^ [0,0, 1]. 



Then by Corollary 7J3 the 2m tangent lines at the unit circle, as well as 
at least 2m — 0(1) of the lines {p, x} connecting p to a point x e X 2m , 
pass through precisely 2 points of X 2m U {p}. It is clear that the 
addition/deletion of K points other thanp cannot add or delete points 
on more than O(K) of these lines, and so P spans 4m — O(K) ordinary 
lines. 

Now suppose that P contains an additional point p on the line at 
infinity. Then the 2m tangent lines to the m th roots of unity, as well as 
the at least 2m — 2 lines {p, x}, x an m th root of unity, which are not 
tangent to the unit circle contain precisely two points of X 2m U {p}. 
Once again the addition/deletion of K points other than p cannot add 
or delete points on more than O(K) of these lines, and so again P 
spans 4m — O(K) ordinary lines. 

We have now reduced matters to the case P C X 2m U [0, 0, 1] . Starting 
from X 2m , the omission of a point or the addition of [0, 0, 1] creates a 
certain number of new lines with precisely two points, and of course 
no point other than [0,0, 1] or the omitted point is on more than one 
of these new lines. By inspection in any case there are always at least 
m — 0(1) of these lines, and so there are at least 4m — O(K) ordinary 
lines unless we do at most one of the operations of adding [0, 0, 1] 
or removing a point of X 2m . At this point a short inspection of the 
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possibilities leads to the conclusion that the Boroczky examples and 
the near-Boroczky examples are the only ones which do not have at 
least 4m — 0(1) ordinary lines. This, at last, concludes the proof of 
Theorem 12.41 □ 



Remark. We relied on Corollary 7.6 which depended on the result 
of Poonen and Rubinstein [30] . For the purposes of proving the Dirac- 
Motzkin conjecture for large n, the somewhat easier Proposition B.2 is 
sufficient. 



9. The orchard problem 



In this section we establish Theorem |1.3[ the statement that a set 
of n points in the plane contains no more than \_\n{n — 3) J + 1 3-rich 
lines when n is sufficiently large. The sharpness of this bound was 
established in Proposition 2.6| 



If iVfc is the number of lines containing precisely k points of P then, 
by double-counting pairs of points in P, we have 



E 



fK=(;Y (9.i! 



k^2 

From this it follows that if N 3 > \_\n{n — 3) J + 1 then 



E 

k>2 



N k < n, (9.2) 



from which we conclude that N2, the number of ordinary lines spanned 
by P, is at most n. Furthermore no line contains more than 0{\/n) 
points. 



We may now apply Theorem 1.5, our structure theorem for sets with 



few ordinary lines. Since no line contains more than 0(y/n) points of P 
we see that in fact only option (iii) of that theorem can occur, that is to 
say P differs in 0(1) points from a coset H © x, 3x G H, of a subgroup 
H of some irreducible cubic curve 7, which is either an elliptic curve 
or (the smooth points of) an acnodal singular curve. The rest of the 
analysis is straightforward but a little tedious. 



Suppose that 3x = Hq. As in the proof of Lemma 8.1 the tangent 
line to 7 at h © x meets H © x in the point (— 2h ho) © x, which is 
distinct from the first point unless 3h + h = 0. There are at most 0(1) 
of these. In creating P from H © x by the addition/deletion of 0(1) 
points, at most 0(1) of these lines are affected. 
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Since P spans at most n ordinary lines, it follows that P contains 
only 0(1) ordinary lines other than these tangent lines. Furthermore, 
since we now know that P contains at least n + 0(1) ordinary lines, 
that is to say N 2 = n + 0(1), we conclude from (9.2) that iV 4 = 0(1). 
We are going to conclude that P = H ©x, a statement whose proof we 
divide into three parts. 

Claim 1. There is no point of P off the curve 7. If p is such a point, 
all except 0(1) of the lines joining p to points of P fl 7 must contain 
precisely two points of P D 7, or else there would be too many lines 
containing p with 2 or 4 points of P. Note that this cannot happen 
if 7 is the smooth points of an acnodal singular cubic curve and p is 
the isolated singular point, since every line through p meets at most 
one point of 7; thus p lies outside of the cubic curve containing 7. 
Consider the lines t from p to P fl 7 which are not tangent to 7 and 
which contain precisely two points of P fl 7 and precisely two points 
of H © x, these points being the same. Since P fl 7 differs from H © x 
in 0(1) points, all except 0(1) of the lines from p have this property. 
But any line not tangent to 7 and containing the two points h © x and 
hi © x also contains —{h®h'@ h ) © x, a third point of H © x. This 
is a contradiction. 

Claim 2. There is no point of P outside the set H®x. Suppose that 
k © x is such a point. Then if h G H, the line joining k © x and h © x 
meets 7 again at —(k(Bh(Bh ) ©x, which is not a point of H@x. This 
point can thus only lie in P for 0(1) values of h, and hence there are 
n — 0(1) ordinary lines of P emanating from k © x. In addition to the 
n — 0(1) tangent lines, this gives at least 2n — 0(1) ordinary lines in 
P, a contradiction. 

Claim 3. P contains all of H ffi x. Suppose that h* © x is a point of 
H (B x not contained in P. For all except 0(1) values of h, the points 
h@x and — (h®h*®ho)®x lie in P, and the line joining h*®xto them 
is not tangent to 7. All such lines then contain precisely two points 
of P, and once again we obtain n — 0(1) ordinary lines to add to the 
n — 0(1) tangent lines we already have. Once again a contradiction 
ensues. 

We have now shown that if P is a set of n points in the plane with 
AT 3 , the number of lines in P spanning precisely 3 points, satisfying 
A^ 3 > \_\n{n — 3)J + 1, then P is a coset H © x on an elliptic curve 
7, with 3x G H. But by Proposition 2.6 (and in the ensuing) we have 
A^3 ^ L| n ( n — 3)J + 1 in any such case, and we are done. 
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Remark. In fact nothing like the full force of Theorem L5 is required 
for this application. Once the much weaker Proposition |5.3| is estab- 
lished, we can immediately rule out possibilities (ii) and (hi) of that 
proposition and hence do away with all of the material in Section [6] and 
some of the material in Section [7] too. 

Appendix A. Some tools from additive combinatorics 

In this section we collect some more-or-less standard tools from ad- 
ditive combinatorics used in Sections [6] and 

If A, B are two sets in some abelian group, and if V C A x B is a 
set of pairs, we write A + r B := {a + b : (a, b) G T}. The next result is 
known as the Balog-Szemeredi-Gowers theorem. The precise form we 
use is a variant of Gowers's version [191 Proposition 12] due to Sudakov, 
Szemeredi and Vu [331 Theorem 4.1]. 

Theorem A.l (Balog-Szemeredi-Gowers). Suppose that A, B are two 
sets in an abelian group, both of size at mostn. Suppose that T C AxB 
is a set (which may be thought of as a bipartitie graph) with \T\ ^ n 2 /K. 
Suppose that \A +r B\ ^ K'n. Then there are sets A 1 C A, B' C B 
with \A'\ ^ n/AK, \B'\ ^ n/lQK 2 such that \A' + B'\ < 2 12 (K') 3 K 5 n. 

Proof. See [33, Theorem 4.1]. In the statement of that result A and 
B are both supposed to have size n, but it easy to see that the proof 
works under the assumption that they both have size at most n. □ 

On several occasions in Section[6]we will apply the preceding theorem 
together with Ruzsa's triangle inequality, which states that \U\\V — 
W\ ^ \U — V\\U — W\ for any sets U,V,W in an abelian group (in 
fact, the group does not even need to be abelian). Let us record, as a 
corollary, the result of doing this in the particular context we need. 

Corollary A. 2. Suppose that A,B are two sets in an abelian group, 
both of size at most n. Suppose that V C A x B is a set with \Y\ ^ 5n 2 
for which \A +r B\ ^ n. Then there are sets A' C A, B' C B with 
\A'\, \B'\ > 5 2 n such that \A' - A'\, \B' - B'\ < <J- U n. 

Proof. This follows immediately from the preceding lemma and the 
Ruzsa triangle inequality. □ 

The following is a "robust" version of the elementary sumset estimate 

\u + v\ > \u\ + \V\ - I. 

Lemma A. 3. Let U, V C K be sets of size r and s respectively. Suppose 
that r C U x V has cardinality at least (1 — 5)rs. Then \U +r V\ ^ 
r + s - 2 - 2y/26rs. 
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Proof. Suppose that U = {ui, . . . ,u r } with u% < • • • < u r , and 
V = {vi, . . . , v s } with vi < ■ ■ ■ < v s . For any 1 ^ k ^ min(r, s) we 
have 

ui + v k < u 2 + v k < ■ ■ ■ < u r - k + v k < u r -k + v k +i < ■■■ < U r - k + v s , 

giving r + s — 2k distinct elements of U + V. As k varies, no pair 
(itj, Vj) appears in this listing more than twice. Thus by the pigeonhole 
principle there is, for any choice of positive integer ko, some k ^ ko such 
that at most 25rs/k elements of this listing come from pairs (ui,Vj) 
not lying in T. It follows that 

\U+ r V\ ^r + s-2k -^. 

k 

Choosing k '■— \y/5rs] confirms the result. □ 

The following result was used heavily in Section [7j It is of a fairly 
standard type and will be of no surprise to experts in additive combi- 
natorics, but we do not know of a convenient reference. 

Proposition A. 4. Suppose that A, B, C are three subsets of some 
abelian group G, all of cardinality within K of n. Suppose that K = 
o(n). Suppose that there at most Kn pairs (a,b) G A x B for which 
a + b ^ C . Then there is a subgroup H ^ G and cosets x + H,y + Hsuch 
that \AA(x + H)\, \BA(y + H)\, \CA(x + y + H)\^7K. 

As remarked in Section [7j results of this general type are quite fa- 
miliar to additive combinatorialists and are of the general form "an 
almost-group is close to a group" . We supply a complete proof here for 
the convenience of the reader. Variants of it are possible. For the most 
part the ideas are due to Kneser [2H Freiman [TS] and Fournier 

We first note that it is enough to prove the following weaker propo- 
sition, which may then be "cleaned up" to give the stated result . 

Proposition A. 5. Suppose that A,B,C are three subsets of some 
abelian group G, all of size within o(n) of n. Suppose that there at 
most o(n 2 ) pairs (a,b) G A x B for which a + b ^ C . Then there 
is a subgroup H ^ G and cosets x + H,y + H such that \AA{x + 
H)\, \BA(y + H)\, \CA(x + y + H)\ = o{n 2 ). 



Let us deduce Proposition A. 4 from this. Let A, B and C be as in the 



hypotheses of that proposition. Then Proposition A. 5 applies and we 
conclude that there is a subgroup H ^ G and cosets x + H,y + H such 
that \AA(x+H)\, \BA(y+H)\, \CA(x+y+H)\ = o(n 2 ). By translating 
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A and B if necessary we may assume without loss of generality that 
x — y = 0. 

Suppose that A = H\XUX', B = H\YUY' and C = H\ZUZ', with 
X,Y,Z C if, X',Y',Z' disjoint from if and all of X, X',Y,Y',Z, Z' 
having cardinality o(n). Now if a G X' then the elements a + 6, 6 G 
H \ X, are all distinct and none of them lie in if. Thus at most 
| if | — |X| — |Z'| = n — o(n) of these can lie in C, and thus we have 
|X'|(n — o(n)) ^ iTn, which implies that \X'\ ^ 2f\\ Similarly \Y'\ ^ 

Now note that, since \X\, \ Y\ < \H\/8, every element of H has at 
least 3 1 if 1 4 ^ n/2 representations as a sum a + b. By assumption, the 
set C contains all of these except those corresponding to at most Kn 
pairs (a, b), and so C contains at least \H\ — 2K elements of H. It 
follows that \Z\ ^2K. 

We have now demonstrated the inequalities 

\A\, \B\ < \H\+2K, \C\^\H\-2K. 
Since the sizes of A, B and C differ by at most K, we must in fact have 
\H\ - 3K < \A\, \B\, \C\ < |iJ| + 3K". 



This allows us to conclude that |Z'| ^ hK. Proposition A. 4 



follows immediately. □ 



We turn now to the task of proving Proposition A. 5 We require the 
following result, which could be deduced from results of Kneser 
and Freiman 1161. 



Lemma A. 6. Suppose that A is a subset of an abelian group G with 
\A\ = n, and suppose that \ A — A\ = n+o(n). Then there is a subgroup 
H ^ G and a coset x + H such that \A/\(x + H)\ = o(n). 

Proof. This is basically the argument of Fournier [17J. Write (cf. 
[37] ) Sym a (A) for the set of all d which have at least an representa- 
tions as at — a,2, ai,a 2 G A. Note that Sym 1 _ 5i (A) + Sym 1 _ (52 (A) C 
Sym 1 _ 5i _ (52 (yi), and note also that Sym 1 _ (5 (A) = n — o{n) for any 
5 ^ 1/6 (say). We claim that H = Sym 2 / 3 (A) is a group. Cer- 
tainly G H, and H + H C Sym 1 / 3 (v4), so all we need do is check 
that Sym 2 / 3 (A) = Sym 1 / 3 (v4). Suppose that d G Sym 1 / 3 (y4). Then 
d = ai — <22 in at least n/3 ways. If t G Sym 5 / 6 (A) then t = a!± — a' 2 in 
at least 5n/6 ways. For at least n/6 of these we will have a\ = a2 for 
some a 2 such that d = cti — a 2 , and thus d + 1 = (at — a 2 ) + (a\ — a' 2 ) = 
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ai — a 2 G A — A. That is, (d + Sym 5 / 6 (A)) D (A — A) ^ n/6. In par- 
ticular, d + Sym 5 / 6 (v4) intersects Sym 5 / 6 (A) (which has size n — o(n)) 
and therefore d G Sym 5 / 6 (v4) — Sym 5 / 6 (A) C Sym 2 / 3 (y4), as required. 

To see that A is close to a coset of H, note that a\ — ci2 G H for all 
but o(n 2 ) of the pairs (01,02) G A. In particular there is some x = 02 
such that ai G x + for all but o(n) values of a\ G A. □ 

We also need the following "99%" version of the Balog-Szemeredi- 
Gowers theorem. 

Lemma A. 7. Suppose that A and B are sets in some abelian group, 
and that T C A x B is a set with \T\ ^ (1 — e)|A||S|. Suppose that 
\A+ r B\ < (l + ^l^l 1 / 2 ^! 1 / 2 . Then there are sets A' C A and B' C B 
with \A'\/\A\, \B'\/\B\ ^ l-e' such that \A'-B'\ ^ (l + e')l^| 1/2 |5| 1/2 . 
We can take e' = 0(e c ) for some absolute constant C . 

Proof. This follows from [57] Theorem 2.31]. If one wanted instead the 
conclusion \A'+B'\ < (l+e')!^! 172 ^! 1 / 2 (for comparison with Theorem 



A.l) then one could additionally apply [37, Proposition 2.27]. □ 



With these lemmas in hand, we can conclude the proof of Proposition 
A. 5 With the hypotheses as in that proposition, first apply Lemma [A. 7 



to conclude that there are sets A',B' with |AAA'|, \BAB'\ = o(n) and 
[A* — B' I = n + o(n). Applying the Ruzsa triangle inequality we obtain 



\A'— A'\, \B'— B'\ = n+o(n). By Lemma A. 6 there are subgroups H, H' 
and cosets x + H, y + H' such that \AA(x + H)\, \BA(y + H')\ = o{n). 
In particular \H\, \H'\ = n + o(n). 

Now for all except o(n 2 ) pairs (a, b) G Ax B, a+b lies in a set C of size 
n+o{n). It follows easily that for all except o{n 2 ) pairs (h, h!) G HxH', 
h + h! lies in a set of size n + o(n). We claim that this forces H = H'. 
To see this, note that the assumption easily implies that there are 
n 3 — o(n 3 ) additive quadruples hi — h 2 = h[ — h' 2 , and so for all but 
o(n 2 ) pairs (hi, h 2 ) G H we have hi — h 2 = h[ — h 2 G H' . This implies 
that all but o(n) elements of H lie in H', and hence \HAH'\ = o(n) 
and so \H D H' \ = n — o(n). By Lagrange's theorem it follows that in 
fact H = H n H' = H' , as claimed. 

Finally, note that since A occupies at least 7\H\/% of x + H, and B 
at least 7\H\/8 of y + H, every element of x + y + H is a sum a + b in 
at least 3\H\/A > n/2 ways. It follows that C must contain all but at 
most o{n) of the elements of x + y + H , and this concludes the proofO 



The next result is due to Elekes, Nathanson and Ruzsa |14j. 
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Proposition A. 8. Let A C R be a set of cardinality n, and suppose 
that there are X\ < • • • < x\q such that f : R — > R is defined except 
possibly at x±, . . . ,x±o and is strictly concave or convex on each open 
interval (xi,Xi + i). Then either | A — A\ or \ f(A) — f(A)\ has cardinality 
at least cn 5 ^. 

Proof. Suppose first of all that / is strictly convex or concave on 
an interval containing A. Then by [HI Corollary 3.1] we have \A — 
A\\f(A) - f(A)\ > n 5 / 2 , and so either \A - A\ or \f(A) - f(A)\ has 
cardinality at least cn 5 / 4 . The proposition follows by applying this to 
the largest of the sets A { := A D (xj, x i+ \). □ 



Appendix B. Intersections of lines through roots of 

UNITY 



In this section we establish Proposition |7.5| Our arguments will be 
a crude variant of those used in [30] . 

It will be convenient to identify the plane M 2 with the complex num- 
bers C. Let LI n be the n th roots of unity, and suppose that p is a 
point other than the origin or an element of U n which is incident to 
m lines £i, . . . , £ m , each of which pass through two points e 2?r4/3j ' , e 2mij 
of LI n , where ^ . . . , (3 m , 71, . . . , 7 m < 1 are distinct rationals with 
denominator n. Our objective is to show that m = 0(1). 

We claim the identity 

|p _ g27ri/3j J I e 27ri/3j _ e 2ni-y k | 



\p — e ^ifik J I e 2vri/3 fe _ ^ni-yj | 

for any distinct 1 ^ j, k ^ m. Indeed, from elementary trigonom- 
etry we see that (p, e 27U/3j ', e 2mik ) and (p, e 2m)3k , e 27rJ7j ') form a pair of 
similar triangles, regardless of the relative ordering between the points 
involved. The right-hand side can be simplified as 

I sin(7r(/3 j - j k ))\ 



sin(7r(/3 fc - 7^ 



We conclude that 



sin(7r(/3j 


- 7fc)) sin 




-7i))l l sin 


>(A 


-7i))l 


| sin(7r(/3 fc 


-7i))l l sin 




- 7 fc ))| | sin 


>(& 


-70)1 



for any distinct 1 ^ j, k ^ m, and thus 

sin(7r(/3 j - 7*.)) sin(7r(/3 fc - 7)) sin(7r(A - 7,)) 

= ± sin(7r(/3 fc - 7,-)) sin(7r(A - 7*.)) sin(7r(/3 j - 7)) 
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for some choice of sign ±. Actually, we claim that the sign here is 
always given by the + sign. To see this, let us temporarily forget that 
the e 2nll3j , e 2mik were constrained to be roots of unity, and that p was 
assumed not to take values at the origin or at infinity (since we have 
not yet actually used these hypotheses). We first observe that the sign 
does not change if we shift any of the pj or 7^ by an integer, so we 
may assume that these phases take values in R/Z rather than [0, 1). 
Then we observe that the sign is stable with respect to continuous 
perturbations of the pj,jk an d p, so long as no two phases cross each 
other, and that e 2wi ^ , e 2ni ^ , p remain collinear for all j . From this we 
may reduce to the case when p is at the origin (so that pj = jj + 1/2 
for all j) or at, say, [1,0,0] (so that (3j = 1/2 — jj for all j), and the 
sign is easily verified in these cases. 

Expanding out sinx as (e lx — e~ lx )/2i, we conclude that 

(^iiPi-lh) _ e 7I "*(7fc-/3j)^ e i"«(/3fc-7;) _ e T*(7t-/3*)V e T*(A-7i) _ ^Hrij-M\ 

Multiplying out both sides, and cancelling the common terms 

appearing on both sides, one arrives at an identity of the form 

12 

^e r e™^=0, (B.l) 

r=l 

where the e r = ±1 are signs depending only on r, and a r -,j,k,i are twelve 
linear combinations of pj, (3k, Pi, Jj,Jk, Ji, each of the form 

a r . jA i = ±(3j ±p k ±Pt± jj ± 7^ ± 7; 

where the six signs ± do not need to be equal, but depend only on the 
index r. These signs can of course be worked out explicitly, but we 
will not need to do so here, save to note that the linear forms a r -,j,k,i 
are all distinct in r, thus a r -,j,k,i — fVy,fc,2 is a non-trivial combination 
of Pj, (3k, Pi, 1j, Ik, li whenever r, r' are distinct. 

Now we reinstate the hypothesis that the e 27rl/3j , e 2mik are nth roots 
of unity, which ensures that the e r e mar '^' k ' 1 are also nth roots of unity. 
The sets of twelve nth roots of unity that sum to zero were completely 
classified in [30, Theorem 3]. The exact classification is somewhat 
messy, but we only require the following qualitative consequence of it. 

Proposition B.l. There exists a finite set S of roots of unity with 
the property that whenever e 2mai , . . . , e 2 ™ 12 are roots of unity with 
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J2 r =i e 2moir = 0, then one has e 2ml - ar ">-') G S for some 1 ^ r < r' < 
12. 

Indeed, one can take S to be the ratios of the roots of unity arising 
in the minimal relations of weight up to 12 that were classified in [5tH 
Theorem 3], the key point being that there were only finitely many 
(107, to be precise) such relations up to rotation. In fact one can take 
S to equal the 3030-element set n 2X 3 X 5 X7x ii if desired. 

Applying this proposition, we conclude that for any distinct k, I 
with 1 ^ j, k, I ^ m, one has 

for some r, r' with l^r<r'^12 and some choice of sign ±. Applying 
the pigeonhole principle, we conclude (for m large enough) that there 
exist r, r' with 1 ^ r < r' ^ 12 and a phase 9 such that 

a r;j,k,l — a r';j,k,l = 

for ^> m 3 triples of distinct 1 ^ j, k, I ^ m. 

Fix r,r',9 as above. As mentioned earlier, a r ;j,k,i ~ a r>-,j,k,i is a non- 
trivial linear form in the fy, fa, Pi, Jj, 7it, 7/- By symmetry, we may 
then assume that at least one of the /3j,jj coefficients in this form are 
non-zero. As these coefficients lie in {—2, 0, +2}, we may thus write 

®r;j,k,l - Ot r >;j,k,l = 2a/3j + 2^ + C k) l 

for some coefficients a, b G { — 1, 0, +1} not both zero, and some phases 
Ckj independent of j. . Note that these coefficients are equal to —2, 0, 
or +2. Pigeonholing in the k, I, we may then find distinct 1 ^ k, I ^ m 
and a phase 9' such that 

a(3j + b^j = 9' 

for ^> m values of j. But from elementary trigonometry, and the 
hypothesis that p is not at the origin (or at infinity) we see that as 
/3,7 G [0,1) range over the distinct phases for which e 27ri/3 , e 2mi are 
concurrent with p, the phase a/3 + by can take on any specific value 9' 
at most 0(1) times, and so m = 0(1) as desired. This concludes the 



proof of Proposition 7.5 □ 



Remark. In principle, an explicit computational analysis of the min- 
imal relations that were classified in [30l Theorem 3] should yield the 



optimal value of C in Proposition 7.5 In [30] it was shown that one 



can take C = 7 if one restricts p to be in the interior of the circle, 
and it is likely that the same bound holds in the exterior region also. 
However, we will not perform this computation here. 
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We now give a weaker version of Proposition 7J3 which is completely 
elementary. In particular, it avoids the Poonen-Rubinstein classifica- 
tion of tuples of twelve roots of unity summing to zero. It can be used 



as a substitute for that proposition in the proof of Theorem 1.2 (and 



hence Theorem 2.2), but not in the stronger Theorem 2.4 



Proposition B.2. Let Il n C C = M 2 denote the regular n-gon consist- 
ing of the n th roots of unity. Then no point other than the origin lies 
on more than 0(n 5 ^ 6 ) lines joining pairs of vertices ofU n . 



Proof. The argument here will be similar to that used at the end of 
Section [7j with the roots of unity Il n playing the role of the coset H®x 
in that analysis. 

Let p be a point other than the origin. Let the vertices of Il n be 
v i, . . . , v n in order. Suppose that the line connecting p and Vj meets the 
line at infinity in the point [— sin 7r8j , cos ndj , 0] . As j ranges between 
1 and n, the 9j can be taken to be an increasing sequence in [0, 2]. 

A Euclidean geometry exercise, left as an exercise to the reader, 
confirms the following claim: for any fixed integer a and for any 
there are at most two values of j such that 6j +a — 9j = 0. 

Suppose that p lies on 5n lines joining pairs of vertices of II n . If Vj is 
one such vertex, then from elementary trigonometry we see that ndj is 
an integer. Thus there is a set J,\J\ = 5n, such that all nQj are integers 
in {1, . . . , 2n} for j G J. Split {1, . . . , 2n} into m ~ |5n intervals of 
length ~ 10/5, and suppose that the number of points of J in these 
intervals is Ni, . . . , N m . Since Ni + ■ ■ ■ + N m = 5n, the Cauchy-Schwarz 
inequality implies that iV 2 + • • • + ^ 5 2 n 2 /m ^ 55n. On the other 
hand this sum is at most the number of pairs in J x J differing by at 
most 10/5. The contribution from the diagonal (pairs (j, j)) is just 5n, 
and so there are at least A5n pairs in J x J differing by at most 10/5. 
By the pigeonhole principle there is some integer a, < a ^ 10/5, 
such that j, j + a G J for ^> 5 2 n values of j. From this sequences of 
js, we may then extract a subsequence ji < • ■ ■ < jd, d ^> 5 3 n, with 
ji+i > ji + a ) suc h that once again ji, ji + a G J for each i. 

Write Xi := n(9j. +a — 9j.). Then, since the 6j are increasing as a 
function of j, all the positive. Furthermore we have x±+- ■ -+Xd ^ 
n. However the all integers, and no integer can occur more than 

twice as a value of X{ by the claim we established at the start of the 
proof. From this it follows that d \fn. 

Comparing these inequalities yields 5 n~ 1//6 , and this completes 
the proof. □ 
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